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Abstract 

In this paper, we propose a novel class of Nash problems for Cognitive Radio (CR) networks, modeled as 
Gaussian frequency-selective interference channels, wherein each secondary user (SU) competes against 
the others to maximize his own opportunistic throughput by choosing jointly the sensing duration, 
the detection thresholds, and the vector power allocation. The proposed general formulation allows 
to accommodate several (transmit) power and (deterministic/probabilistic) interference constraints, 
such as constraints on the maximum individual and/or aggregate (probabilistic) interference tolerable 
at the primary receivers. To keep the optimization as decentralized as possible, global (coupling) 
interference constraints are imposed by penalizing each SU with a set of time-varying prices based 
upon his contribution to the total interference; the prices are thus additional variable to optimize. The 
resulting players' optimization problems are nonconvex; moreover, there are possibly price clearing 
conditions associated with the global constraints to be satisfied by the solution. All this makes the 
analysis of the proposed games a challenging task; none of classical results in the game theory literature 
can be successfully applied. 

The main contribution of this paper is to develop a novel optimization-based theory for studying the 
proposed nonconvex games; we provide a comprehensive analysis of the existence and uniqueness of 
a standard Nash equilibrium, devise alternative best-response based algorithms, and establish their 
convergence. Some of the proposed algorithms are totally distributed and asynchronous, whereas some 
others require limited signaling among the SUs (in the form of consensus algorithms) in favor of better 
performance; overall, they are thus applicable to a variety of CR scenarios, either cooperative or non- 
cooperative, which allows the SUs to explore the existing trade-off between signaling and performance. 

1 Introduction 

Over the past decade, there has been a growing interest in Cognitive Radio (CR) as an emerging paradigm 
to address the de jure shortage of allocated spectrum that contrasts with the de facto abundance of unused 
spectrum in virtually any spatial location at almost any given time. The paradigm posits that so-called 
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cognitive radios [also termed as secondary users (SUs)] would use licensed spectrum in an ad-hoc fashion 
in such a way as to cause no harmful interference to the primary spectrum license holders [also termed as 
primary users (PUs)]. Evidently, such an opportunistic spectrum access is intertwined with the design of 
multiple secondary system components, such as (but not limited to) spectrum sensing and transmission 
parameters adaptation. Indeed, the choice of the sensing parameters (e.g., the detection thresholds and 
the sensing duration) as well as the consequent design of the physical layer transmission strategies (e.g., 
the transmission rate, the power allocation) have both a direct impact on the performance of primary and 
secondary systems. The interplay between these two interacting components calls for a joint optimization 
of the sensing and transmission parameters of the SUs, which is the main focus of this paper. 

1.1 Motivation and related work 

The joint optimization of the sensing and transmission strategies has been only partially addressed in 
the literature, even for simple CR scenarios composed of one PU and one SU. For example, in [H [2], 
the authors proposed alternative centralized schemes that optimize the detection thresholds for a bank of 
energy detectors, in order to maximize the opportunistic throughput of a SU, for a given sensing time and 
constant-rate/power transmissions. The optimization of the sensing time and the sensing time/detection 
thresholds for a given missed detection probability and constant rate of one SU was addressed in [3l H] 
and [5] , respectively. A throughput-sensing trade-off for a fixed transmission rate was studied in [6] . In [7] 
(or [8]) the authors focused on the joint optimization of the power allocation and the equi-false alarm rate 
(or the sensing time) of a SU over multi-channel links, for a fixed sensing time (or detection probability). 
All the aforementioned schemes however are not applicable to scenarios composed of multiple SUs (and 
PUs). The case of multiple SUs and one PU was considered in |9] (and more recently in |10|). under the 
same assumptions of however no formal analysis of the proposed formulation was provided. 

The transceiver design of OFDM-based CR systems composed of multiple primary and secondary 
users have been largely studied in the literature of power control problems over the interference channel, 
and have been traditionally approached from two very different perspectives: a holistic design of the 
system and an individual selfish design of each of the users. The former is also referred to as Network 
Utility Maximization (NUM) (other approaches within this perspective are based on Nash bargaining 
formulations) and has the potential of obtaining the best of the network at the expense of a centralized 
computation or heavy signaling/cooperation among the users; examples are |1 H [T2 1 [T3 | I14 1 [T5 1 I16 | I17|. The 
latter fits perfectly within the mathematical framework of Game Theory and usually leads to distributed 
algorithms at the expense of a loss of global performance; related papers are \18\ \T9\ [20l |2H |22| I23| . and 
two recent overviews are |24|I25|. In both the aforementioned approaches and classes of papers the sensing 
process is not considered as part of the optimization; in fact the SUs do not perform any sensing but 
they are allowed to transmit over the licensed spectrum provided that they satisfy interference constraints 
imposed by the PUs, no matter if the PUs are active of not. 

When the sensing comes explicitly into the system design, the application of the holistic approach 
mentioned above leads to nonconvex NP hard optimization problems. These cases cannot be globally solved 
by efficient algorithms in polynomial time; one typically can design (centralized) sub-optimal algorithms 
that converge just to a stationary solution. Their implementation however would require heavy signaling 
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among the users (or the presence of a centrahzed network controller having the knowledge of all the 
system parameters); which strongly limits the range of applicability of such formulations to practical CR 
networks. For these reasons, in this paper, we attack the multi-agent decision making problem from a 
different perspective; we concentrate on optimization strategies where the SUs are able to self-enforce the 
negotiated agreements on the usage of the licensed spectrum either in a totally decentralized way or by 
requiring limited and local signaling among the SUs (in the form of consensus algorithms). Aiming at 
exploring the trade-off between signaling and performance, the proposed approach is then expected to be 
more flexible than classical optimization techniques and applicable to a wider range of CR scenarios. 

1.2 Main contributions 

This paper along with our companion work [26] advances the current approaches (based on the optimization 
of specific components of a CR system in isolation), in the direction of a joint and distributed design of 
sensing and transmission parameters of a CR network, composed of multiple PUs and SUs. 

We study a novel class of Nash equilibrium problems as proposed in |26j . wherein each SU aims at 
maximizing his own opportunistic throughput by jointly optimizing the sensing parameters— the sensing 
time and the false alarm rate (and thus the decision thresholds) of a bank of energy detectors— and the 
power allocation over the multi-channel links. Because of sensing errors, the SUs might access the licensed 
spectrum when it is still occupied by active PUs, thus causing harmful interference. This motivates the 
introduction of probabilistic interference constraints that are imposed to control the power radiated over 
the licensed spectrum whenever a missed detection event occurs (in a probabilistic sense). The proposed 
formulation accommodates alternative combinations of power/interference constraints. For instance, on 
top of classical (deterministic) transmit power (and possibly spectral masks) constraints, we envisage the 
use of average individual (i.e., on each SU) and/or global (i.e., over all the SUs) interference tolerable at 
the primary receivers. The former class of constraints is more suitable for scenarios where the SUs are 
not willing to cooperate; whereas the latter constraints, which are less conservative, seem more realistic 
in settings where SUs may want to trade some limited signaling for better performance. By imposing a 
coupling among the transmit and sensing strategies of the SUs, global interference constraints introduce 
a new challenge in the system design: how to enforce global interference constraints without requiring 
a centralized optimization but possibly only limited signaling among the SUs? We address this issue by 
introducing a pricing mechanism in the game, through a penalization in the players' objective functions. 
The prices need to be chosen so that the interference constraints are satisfied at any solution of the game 
and a clearing condition holds; they are thus additional variables to be determined. 

The resulting class of games is nonconvex (because of the nonconvexity of the players' payoff functions 
and constraints), lacks boundedness in the price variables, and there are side constraints with associated 
price equilibration that are required to be satisfied by the equilibrium; all these features make the analysis 
a challenging task. The convexity of the players' individual optimization problems is, in fact, one indis- 
pensable assumption under which noncooperative games have traditionally been studied and analyzed. 
The classical case where a NE exists is indeed when the players' objective functions are (quasi-) convex in 
their own variables with the other players' strategies fixed, and the players' constraint sets are compact 
and convex and independent of their rivals' strategies (see, e.g., |27| 128)). Without such convexity, a NE 
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may not exist (as in the well-known case of a matrix game with pure strategies); analytically, abstract 
mathematical theories granting its existence, like those in [29\ [3D], are difficult to be applied to games 
arising from realistic applications such as those occurred in the present paper. 

The main contribution of this work is to develop a novel optimization-based theory for the solution 
analysis of the proposed class of nonconvex games (possibly) with side constraints and price clearing con- 
ditions, and to design distributed best-response based algorithms for computing the Nash equilibria, along 
with their convergence properties. Building on |31| . the solution analysis is addressed by introducing a 
"best-response" map (including price variables) defined on a proper convex and compact set, whose fixed- 
points, if they exist, are Nash equilibria of the original nonconvex games; the obtained conditions are in 
fact sufficient for such a map to be a single-valued continuous map; this enables the application of the 
Brouwer fixed-point theorem to deduce the existence of a fixed-point of the best-response map, thus of a 
NE of the whole class of proposed games. While seemingly very simple, the technical details lie in deriving 
(reasonable) conditions for which the best-response map is single- valued and for the boundedness of the 
prices in order for the existence of a compact set on which the Brouwer result can be based. Interestingly, 
the obtained conditions have the same physical interpretation of those obtained for the convergence of 
the renowned iterative waterfilling algorithm solving the power control game over interference channels 
|18 | I19 1 [20 | I2 H I22). We then focus on solutions schemes for the proposed class of games; we design alterna- 
tive distributed (possibly) asynchronous best-response based algorithms that differ in performance, level 
of protection of the PUs, computational effort and degree of cooperation/signaling among the SUs, and 
convergence speed; which makes them applicable to a variety of CR scenarios (either cooperative or nonco- 
operative). For each algorithm, we establish its convergence and also quantify the time and communication 
costs for its implementation. Our numerical results show that: i) the proposed joint sensing/transmission 
optimization outperforms current centralized and decentralized state-of-the-art results based on separated 
optimization of the sensing and the transmission parts; ii) our algorithms exhibit a fast convergence behav- 
ior; and iii) as expected, some (limited) cooperation among the SUs (in the form of consensus algorithms) 
yields a significant improvement in the system performance. The proposed solution schemes can also be 
used to compute the so-called Quasi-NE of the associated games, a relaxed equilibrium concept introduced 
and studied in our companion paper |26| . 

The paper is organized as follows. Sec. [2] briefly introduces the system model, as proposed in |26| : 
Sec. [3] focuses on the system design and formulates the joint optimization of the sensing parameters and 
the power allocation of the SUs within the framework of game theory; several games are introduced. The 
solution analysis of the proposed games is addressed in Sec. HI where sufficient conditions for the existence 
and uniqueness of a standard NE along with their interpretation are derived. Distributed algorithms 
solving the proposed games along with their convergence properties and computational/communication 
complexity are studied in Sec. El Numerical experiments are reported in Sec. [6l whereas Sec. [7] draws 
the conclusions. Proofs of our results are given in Appendix IAUFI The paper requires a background 
on Variational Inequalities (Vis); we refer to \32\ I33| for an introductory overview of the subject and its 
application to equilibrium problems in signal processing and communications. A comprehensive treatment 
of Vis can be found in the two monographs |34| [35] ; a detailed study of convex games based on the VI 
and complementarity approach is addressed in |36| I22|. The main properties of Z and P matrices, which 
are widely used in the paper, can be found in |34| 137) . 
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2 System Model 

We consider a scenario composed of Q active SUs, each consisting of a transmitter-receiver pair, coexisting 
in the same area and sharing the same band with PUs. The network of the SUs is modeled as an A^- 
frequency-selective SISO Interference Channel (IC), where N is the number of subcarriers available to 
the cognitive users. We focus on multicarrier block-transmissions without loss of generality. In order not 
to interfere with on-going PU transmissions, before transmitting, the SUs sense periodically the licensed 
spectrum looking for the subcarriers that are temporarily not occupied by the PUs. A brief description of 
the sensing mechanism and transmission phase performed by the SUs as proposed in the companion paper 
|26) is given in the following, where we introduce the basic definitions and notation used throughout the 
paper; we refer the reader to |25) for details and the assumptions underlying the proposed model. 

2.1 The spectrum sensing phase 

In |26) . we formulated the sensing problem as a binary hypothesis testing; the decision rule of SU q over 
carrier k = \, . . . ,N based on the energy detector is 

where yq,k[n] is the received baseband complex signal over carrier Kg = [rg fq\ Tq fq is the number 
of samples, with Tq and fq denoting the sensing time and the sampling frequency, respectively; 'jq^k is the 
decision threshold for the carrier k; Tio^k represents the absence of any primary signal over the subcarrier 
k, whereas Tii^k represents the presence of the primary signaling. 

The performance of the energy detection performed by SU q over carrier k is measured in terms of the 
detection probability Pqk{lq,k,Tq) = Proh {Dqi^ > | ^i,fc} and false alarm probability Pq\{lq.k,Tq) = 
Vxoh{Dq^k > lq,k \ ^o,fc}- Under standard assumptions in decision theory, these probabilities are given by 
[26] 

n!U7...,r,) = Q(v^ ^^'^"^^-^l° ) and P^^, (7,,., r,) = Q ^^^^^^) , (2) 

where Q{x) = (l/\/27r) e~^^^'^dt is the Q-function, and fiq^k\o> ^g.fclii ^q,k\0' ^^'^ '^qM^ constant 
parameters, whose explicit expressions are given in |26| . The detection probability P^j^ can also be 
rewritten as a function of the false alarm rate P^^l as: 

Pq% {Pq%, rq) = Q (^Q-^ {P^%) - ^q ff^^^^Zi!^) A ^ _ p^SS (^^^ p^fa) (3) 

V '-'g,fc|l '-'g,fc|l / 

where we also introduced the definition of the missed detection probability P^i^^{Tq, Pg%) — 1 — Pf^^.(rg, P^^' 



q,k yq^^ q,kl ~ ^ q,k\' q,k) 

The interpretation of P^^ {"fq,k, Tq) and P^j^ {'Jq,k, Tq) within the CR scenario is the following: 1 — P^^ 
signifies the probability of successfully identifying from the SU q a spectral hole over carrier k, whereas the 
missed detection probability P^^^ represents the probability of SU q failing to detect the presence of the 
PUs on the subchannel k and thus generating interference against the PUs. The free variables to optimize 
are the detection thresholds 7g.fc's and the sensing times Tq's; ideally, we would like to choose 7g.fc's and 
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Tq's in order to minimize both P^^. and Pj^^^J , but ([3]) shows that there exists a trade-off between these 
two quantities that wih affect both primary and secondary performance. It turns out that, 7g,fc's and r^'s 
can not be chosen by focusing only on the detection problem (as in classical decision theory), but the 
optimal choice of ^q^k and Tq must be the result of a joint optimization of the sensing and transmission 
strategies over the two phases; such an optimization is introduced in Sec. [3l 

Robust sensing model. The proposed sensing model can be generalized in several directions; see 
|38| I26|. For instance, one can explicitly take into account device- level uncertainties (e.g., uncertainty in 
the power spectral density of the PUs' signals and thermal noise) as well as system level uncertainties 
(e.g., the current number of active PUs) by modeling the detection process of the primary signals as a 
composite hypothesis testing. This leads to a uniformly most-powerful detector scheme that is robust 
against device-level and system-level uncertainties; detailed can be found in |38| I26| and are omitted here. 
It is important however to remark that the resulting detection probability and false alarm rate of the 
aforementioned robust scheme are still given by ([2|) and ([3]), but with a different expression for /ig^fc|j's 
and cr^^|.'s |38) . This means that analysis and results developed in the next sections are valid also for this 
more general model. 

2.2 The transmission phase 

The transmission strategy of each SU q is the power allocation vector Pg = {pq,k}k=i o'^^r the subcar- 
riers, subject to the following (local) transmit power constraints 

Vq^f^Pq^{Pq,k)tl^^'' ■■T.P'^,k<P„ < Pg < p^^ | , (4) 

where p™*^^ = (p™^^)^-^ denotes possibly spectral mask [the vector inequality in is component-wise]. 

According to the opportunistic transmission paradigm, each subcarrier k is available for the transmis- 
sion of SU q if no primary signal is detected over that frequency band, which happens with probability 
1 — Pq%- This motivates the use of the aggregate opportunistic throughput as a measure of the spectrum 
efficiency of each SU q. Given the power allocation profile p = (pg)^^ of the SUs, the target false alarm 
rate P^^ (assumed to be equal over the whole licensed spectrum), the sensing time Tq, and taking the log 
of the opportunistic throughput, the payoff function of each SU q is then (see |26| for more details) 



Rq {Tq, p, P/^) = log - ^) (1 - P;-) f; rg,, (p)^ 



(5) 



where 1 — Tq/Tq, with Tq < Tq, is the portion of the frame duration Tg available for opportunistic trans- 
missions and Tg fc(p) is the maximum information rate achievable on link q over carrier k when no primary 
signal is detected and the power allocation profile of the SUs is pi^k, ■ ■ ■ ,PQ,k' 

rq fc(p) = log I 1 + ^^4 1 , (6) 

with Hqr{k) = Hqr{k)/ Hqq{k) and (Tgfc — cr"^ i^/\Hqq{k)\'^ , where {Hqq{k)}^^^ is the channel transfer 
function of the direct link q and {Hqr{k)}^^i is the cross-channel transfer function between the secondary 
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transmitter r and the secondary receiver q; and o"^ ^ is the power spectral density (PSD) of the background 
noise over carrier k at the receiver q (assumed to be Gaussian zero- mean distributed). 

As a final remark note that the throughput defined in ([5]) is not the average throughput experienced by 
the SUs, which instead would include an additional rate contribution resulting from the erroneous decision 
of the SUs to transmit over the licensed spectrum still occupied by the PUs. We have not included this 
contribution in the objective functions of the SUs because in maximizing the function we do not want 
to "incentivize" the undue usage of the licensed spectrum. Moreover, differently from the opportunistic 
throughput in ([5]), the maximization of the average throughput would require the knowledge from the SUs 
of the a-priori probabilities of the PUs' spectrum occupancy, which is in general not available. 

2.3 Probabilistic interference constraints 

Due to the inherent trade-off between P^^^ and Pg'^t^Pff) [see ^ and maximizing the aggregate 
opportunistic throughput ([5]) of SUs will result in low P^^ and thus large P^j^^^, hence causing harmful 
interference to PUs. To allow the SUs' transmissions while preserving the QoS of the PUs, we envisage 
the use of probabilistic interference constraints that limit the interference generated by the SUs whenever 
they misdetect the presence of a PU. Examples of these constraints are the following: 

- Individual overall bandwidth interference constraint: for each SU q, 

N 

J2 {r„ Pt) ■ ^,,k ■ P,,k < ir^"^ (7) 

k=l 

- Global overall bandwidth interference constraints: 

Q 

9=1 fcG/Cp 

where I^^^ [or /™^^] are the maximum average interference allowed to be generated by the SU q [or all the 
SU's] that is tolerable at the primary receiver; and Wg^s are a given set of positive weights. If an estimate 
of the cross-channel transfer functions {Gp^q{k)}^^^ between the secondary transmitters and the primary 
receiver is available, then the natural choice for Wq^k is Wg^k = \Gp,q{k)\'^ , so that ([7| and ([8]) become 
the average interference experienced at the primary receiver. Methods to obtain the interference limits 
along with some implementation aspects related to this issue and alternative interference constraints are 
discussed in Sec. 15.1.11 

We wish to point out that other interference constraints, like per-carrier interference constraints, as 
well as multiple PUs can be readily accommodated, without affecting the analysis and results that will be 
presented in the forthcoming sections. For notational simplicity, we stay within the above setting. 

3 System Design based on Game Theory 

We focus now on the system design and formulate the joint optimization of the sensing parameters and 
the power allocation of the SUs within the framework of game theory. We consider next two classes of 
equilibrium problems: i) games with individual constraints only (Sec. 13.11 below): and ii) games with 
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individual and global constraints (Sec. 13.11 and Sec. 13.31 below). The former formulation is suitable for 
modeling scenarios where the SUs are selfish users who are not willing to cooperate, whereas the latter 
class of games is applicable to the design of systems where the SUs can exchange limited signaling in favor 
of better performance. Indeed, being less conservative than individual interference constraints, global 
interference constraints are expected to yield better performance of the SUs at the cost of more signaling. 
The aforementioned formulations are thus applicable to complementary CR scenarios. 

3.1 Game with local interference constraints 

In the proposed game, each SU is modeled as a player who aims to maximize his own opportunistic 
throughput Rq (r^, p, Pg^) by choosing jointly a proper power allocation strategy Pq = {pq,k)k=ii sensing 
time Tg, and false alarm rate Pq'^, subject to power and individual probabilistic interference constraints. 
Stated in mathematical terms we have the following formulation. 

Player g's optimization problem is to determine, for given p_g = ((Pr(^)fcLi)^r=i — 0, a 
tuple (tq, Pq, Pq^) in order to 

Rq (r„ p, P/-) 

N 

P^riPt' Tq) ■ Wq,k ■ Pq,k < ir"" ' (9) 

k=l 

Pq'"" < and P.^^rin'"'^?) < VA; = !,••• ,iV, 

p, G Vq and rf^ < Tq < r^^^ 

In ^ we also included additional lower and upper bounds of Tq satisfying < r™"^ < r™^^ < Tq 
and upper bounds on detection and missed detection probabilities < Oq^k ^1/2 and < Pq < 1/2, 
respectively. These bounds provide additional degrees of freedom to limit the probability of interference 
to the PUs as well as to maintain a certain level of opportunistic spectrum utilization from the SUs 
[1 — Pg^ > 1 — Pq]- Note that the constraints aq^k ^ 1/2 and /3q < 1/2 do not represent a real loss of 
generality, because practical CR systems are required to satisfy even stronger constraints on false alarm 
and detection probabilities; for instance, in the WRAN standard, aq^k = Pq,k =0.1. 

3.2 Game with global interference constraints 

We add now global interference constraints to the game theoretical formulation in ([9]). This introduces 
a new challenge: how to enforce global interference constraints in a distributed way? By imposing a 
coupling among the transmissions and the sensing strategies of all the SUs, global interference constraints 
in principle would call for a centralized optimization. To overcome this issue, we introduce a pricing 
mechanism in the game, based on the relaxation of the coupling interference constraints as penalty term 
in the SUs' objective functions, so that the interference generated by all the SUs will depend on these 
prices. Prices are thus addition variables to be optimized (there is one common price associated with any 



maximize 
subject to 

(a) 

(b) 
(c) 
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of the global interference constraints); they must be chosen so that any solution of the game will satisfy 
the global interference constraints, which requires the introduction of additional constraints on the prices, 
in the form of price clearance conditions. Denoting by tt the price variable associated with the global 
interference constraint ([8]), we have the following formulation. 



X LCXyK^L l£ O LIIIIIZjCaLIUII ^lUlLlldll io hyj LlC LL-i ilililC ^ iUi giVCil ^ q \J dilU. /I , f I L Li^itz; 




that TV 




maximize Rq (r^, p, P/^) - vr • V Pf^\Pl'', Tq) ■ Wq^k ■ Pq,k 

^"'^■"■^9 fc=l 




(10) 


subject to constraints (a), (b), (c) as in ([9]). 




Price equilibrium: The price vr obeys the following complementarity condition: 




N Q 




< vr ± - ^ ^ P^riP^, Tq) ■ Wq,k ■ Pq,k > 0. 


(11) 


fc=l q=l 





In ([TT]) . the compact notation < a _L 6 > means a > 0, 6 > 0, and a • 5 = 0. The price clearance 
conditions (llip state that global interference constraints ([8]) must be satisfied together with nonnegative 
price; in addition, they imply that if the global interference constraint holds with strict inequality then 
the price should be zero (no penalty is needed). Thus, at any solution of the game, the optimal price is 
such that the global interference constraint is satisfied. 



3.3 The equi-sensing case 

The decision model proposed in Sec. 12.11 is based on the assumption that the SUs are somehow able to 
distinguish between primary and secondary signaling. This can be naturally accomplished if there is a 
common sensing time (still to optimize) during which all the SUs stay silent while sensing the spectrum. 
However, the formulation (jlOp . in general, leads to different optimal sensing times of the SUs, implying 
that some SU may start transmitting while some others are still in the sensing phase. To overcome this 
issue, several directions have been explored in the companion paper |26| . under the model (jlOp -( fTT]l . Here 
we follow the approach of modifying the formulation in (llOp in order to "force" in a distributed way the 
same optimal sensing time for all the SUs. Roughly speaking, the idea is to perturb the payoff functions of 
the players by a penalty term that discourages the players to deviate from equi-sensing strategies. Stated 
in mathematical terms, we have the following formulation. 

Player g's optimization problem is to determine, for given c > 0, p_q > 0, (r^)^,,^-)^ > and vr > 0, 
a tuple (xq, Pq, Pq^) in order to 

^ c ( I ^ V 

maximize Rq (r^, p, P/^) - vr • ^ Pq^^P^^ ^g) " Wq,k ■ Pq,k " n " h"-? " n 5Z 

""P"-^/'' k=l ^ V ^ r=l J (12) 

subject to constraints (a), (b), (c) as in ([9]). 
Price equilibrium: The price vr obeys the complementarity condition (jlip . 
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The third term in the objective function of each SU in (|12p helps to induce the same optimal sensing 
time for all the SUs. Roughly speaking, one expects that for sufficiently large c, the aforementioned term 
will become the dominant term in the objective functions of the SUs, leading thus to solutions of the game 
having sensing times that differ from their average by any prescribed accuracy. This intuition has been 
made formal in our companion paper |26j for stationary solutions of the game (jl2p , and it can be similarly 
extended to the Nash equilibria; we omit the details because of space limitation. 

3.4 Unified formulation and summary of notation 

In this section, we introduce a compact and unified formulation of the proposed games that simplifies 
their analysis. Let us start by separating the convex constraints in the feasible set of the players from 
the nonconvex ones. The interference constraints (a) in ([9]) are bi-convex and thus not convex, whereas 
constraints (b) are convex in P^^'^ and y/Tq- This motivates the following change of variables: 



'rq^Tg = y%fq q=l,...,Q, (13) 

so that the constraints on P^^!^'^ {P^^ , Tg) in each player's feasible set become convex in the tuple {Pg^,Tq) 
[with P^^ < /3g ]. Indeed, for each k = I, . . . , N , we have 

P^tiPt^ r,) < a,,, ^ ^ Q-i (P/-) - % ^<^Ml-^^,,m < g-i _ ^^^^) ^ ^^^^ 



where Q ^ (•) denotes the inverse of the Q- function [Q(x) is a strictly decreasing function on M], which 

^<?) [provided that P/' 



are convex constraints in [P^^,Tq) [provided that P^^ < /3g ]. Using the above transformation, we can 



equivalently rewrite the missed detection probability P^k^{Pq^^Tq) and the throughput Rq{Tq, p, Pg^) 
of each player q in terms of the tuples (r^, Pq, Pq'^)'s, denoted by P^k^{Pg^,Tq) and Rg{Tg, p, Pg^), 
respectively; the explicit expression of these quantities is: 

nmiss/Dfa _ \ _ Smiss/ofa ^ \ A n ( '^qMO Q ^ {Pq'^) ~ (/^g,fc|l ~ /^9,/c|0 ) ''"g \ 

1f2 \ N \ 



Rq (r„ p, P/^) = Rq (f„ p, P/-) A log 1^ 1^ 1 _ _^ j ^ ( 1 _ p^fa^ ) , (p) j . (16) 

To incorporate the equi-sensing case in our unified formulation, we introduce the functions 6g{xg, x_g), 
which represent the objective functions of the users including the equi-sensing term, with (r, p, P^^) = 
((7^, Pg, Pg^^))'^-^ denoting the strategy profile of all the players: 




Oqir, p, P*^) ^ Rq{?g, p, p;-) - ^ - \ ^ . (17) 



We can now rewrite the feasible set of each player's optimization problem in terms of the new variables 
(rg, Pg, Pg^) , dcuotcd by Xg: for each q = 1, . . . ,Q, let 

Xg ^ { (f„ p„ P,f-) (^yg\Ig (f„ p„ P/^) < 0} (18) 
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where we have separated the convex part and the nonconvex part; the convex part is given by the poly- 
hedron yq corresponding to the constraints (b) and (c) in Q under the transformation (|13|) [cf. (|14p ]: 



with 



(19) 



a,,fc^Q"^(l-a,,fc), f--4^r--/^^ and f--A^^min/^, (20) 

whereas the nonconvex part in (jlSp is given by the constraint (a) that we have rewritten as Ig{Tq, Pq, Pq^) < 
by introducing the local interference violation function 

N 

iq (r„ p„ p/^) ^ ^ P-r (^/^ T,) • t/^,,. • v,,k - /r^. (21) 
it=i 

This measures the violation of the local interference constraint (a) at (rg, pg, Pq^)- Similarly, it is con- 
venient to introduce also the global interference violation function /(r, p, P^*^), which depends on the 
strategy profile (r, p, P^*^) of all the players: 

N Q 

/(r,p,P^^)^5]5^P-r(^^r,)-«;,,fc-P,,fc -I-^^; (22) 

fc=l q=l 

I{t, p, P^'^) measures the violation of the global interference constraint ([8|) at (r, p, P^*^); global inter- 
ference constraints ([8]) can be then rewritten in terms of /(t, p, P^^) as /(r, p, P^^) < 0. 

Based on the above definitions, throughout the paper, we will use the following notation. The convex 
part of the joint strategy set is denoted by 3^ = n^i-^g' whereas the set containing all the (convex 
part of) players' strategy sets except the q-th one is denoted by 3^_g = JlrT^g'^'-' similarly, we define 
X = ^^'^ '^-q — Ylr^q ^r- For notational simplicity, when it is needed, we will use interchangeably 

either (t^, pq, Pq^) or Xg = (tq, pg, Pq^) to denote the strategy tuple of player q; similarly, the strategy 
profile of all the players will be denoted either by x = (xg)^^-,^ or (r, p, P^'^), with r = {Tq)^^^, p = 
(Pg)^i) S'lid P^*^ = (Pg*^)^^]^, whereas x_q = {ii.r)'^^r=i strategy profile of all the players except 

the g-th one. All the tuples above are intended to be column vectors; for instance, (r, p, pf^) si gnifies 
(?, p, P^^) = [t^, p^, P^^"]^, with r 4 {Tq)%, = [n, . . . ,fQ]^, p ^ {vq)%, = [pf , . . . , p^]^, whcrc 
each pg = {pq,k)k=i = \PqA^---^Pq,NV^ and P^^^ = (Pg^)^=i = [P/'', . . . , P,^^]^. For future convenience, 
Table [1] collects the above definitions and symbols. Using the above notation, the games introduced in the 
previous sections can be unified under the following reformulation. 



Players' optimization. The optimization problem of player q is: 

maximize Gq(xq, x_„) — vr • I(x) 

(23) 

subject to Xq = {Tq, Pq, P^"^) G Xq. 

Price equilibrium. The price obeys the following complementarity condition: 

< TT ± -/(x) > 0. (24) 
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Throughout the paper, we wih refer to the game (|23p along with the side constraint (j24p as game 
g{X, e), where 6 ^ (0,(xg, x.^, 7r))^^i. 

Table 1: Glossary of notation of game g{X, 6) [cf. (f23l) -(l2ll] 



Symbol 


Meaning 




sensing time of SU q 


Vq - [Pq,k)k=l 


power allocation vector of SU q 


TT 


scalar price variable 


pfa 

q 


false alarm probability of SU q 


pmiss 


missed detection probability of SU q on carrier k [cf. (|3]l] 


^ — /t f 
Tq — \JTq]q 


normalized sensing time of SU q [cf. (I13p] 


Xf/ = {jqi Vqi Pq ) 


strategy tuple of SU q 


X— g = (tV, Pn Pr }rytq 


strategy profile of all the SUs except the q-th. one 


X ^ (x,),li = (r, p,pf-) 


strategy profile of all the SUs 


^qi^qj X_q) 


payoff function of SU q including the equisensing penalization [cf. (|17p[ 


^g(xg) 


local interference constraint violation of SU q [cf. (j2ip] 


/(x) 


global interference constraint violation of SU q [cf. (f22|)] 


^g. ^ - llq=i 


feasible set of SU q [cf. p8|)]. joint feasible strategy set of g{X, 6) 




joint strategy set of the SUs except the q-th one 




convex part of Afg [cf. (119p]. Cartesian product of all J'g's 



Needless to say, when vr = and c = 0, g{X, 0) reduces to the game in ^ where there are only 
individual interference constraints ([7]), whereas when c = 0, Q{X , 0) coincides with the game in (jlOp -(|li p 
with local and global interference constraints. 

As a final remark, we observe that the proposed formulations may be extended to cover more general 
settings, without affecting the validity of the results we are going to present. For instance, the case of 
multiple active PUs and additional local/global interference constraints (such as per-carrier constraints) 
can be readily accommodated: Instead of having a single price variable, we associate a different price 
to each global interference constraint and proceed similarly as in (I23p - p4l) . Also, the sensing model 
introduced in Sec. 12.11 can be generalized to the case of multiple active PUs, and the presence of device- 
level uncertainties (e.g., uncertainty in the power spectral density of the PUs' signals and thermal noise) as 
well as system level uncertainties (e.g., lack of knowledge of the number of active PUs). The mathematical 
details of these more general formulations can be found in our companion paper [26]; for notational 
simplicity, here we will stay within the formulation (j23p -(|24 p . without loss of generality. 

4 Solution Analysis: Nash Equilibria 

This section is devoted to the solution analysis of the games introduced in the previous section. In order 
to provide a unified analysis, we focus on the general game Q{X, 0) with side constraints; results for the 
other proposed formulations are obtained as special cases. We start our analysis by studying the feasibility 
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of each optimization problem in (p3|) (cf. Sec. 14. ip : we then extend the definitions of NE to a game with 
side constraints and estabhsh its main properties (cf. Sec. 14. 2p . 



4.1 Feasibility conditions 

Introducing the SNR detection snr^^ = a'j f,/'^qk experimented by SU q over carrier k and using the 
definitions given in Sec. 12.11 sufficient conditions guaranteeing the existence of an optimal solution for 
each player's optimization problem (j23p are the following: For all q = 1, . . . ,Q and k = 1, . . . ,N, there 
must exist a common sensing time r (corresponding to normalized sensing times Tq = \/t fq) such that 

2;<v7<5:. vv> e-'(V) + ie-K>)l Km.K..)^ 

The first set of conditions in (j25p simply postulates the existence of an overlap among the (normalized) 
sensing time intervals t^^^/ \/7q] hi (I23p . which is necessary to guarantee the existence of a 

common value for the sensing times in the original variables r^'s. The second set of conditions guarantees 
that the strategy sets 3^q's (and thus Xq's) are not empty. Interestingly, they quantify the existing trade-off 
between the sensing time (the product "time-bandwidth" fqT of the system) and detection accuracy: the 
smaller both false alarm and missed detection probability values, the larger the sensing time (the decision 
process must be more accurate). 

When the sensing times are not forced to be the same, as in the formulations ([9]) and (fTO|) -( fTT]l . the 
feasibility conditions (|25p can be weakened by the following: For all g = 1, . . . , Q and k = 1, . . . , N , 

fq'^> + KfcliKfcio) ^ ^26) 

Throughout the paper, we tacitly assume that each user's optimization problem under consideration has 
a nonempty strategy set (the associated feasibility conditions above are satisfied). 

4.2 Existence and uniqueness of the NE 

We focus in this section on the NE of G{X, 0). The definition of NE for a game with price equilibrium 
conditions such as Q{X ^ 0) is the natural generalization of the same concept introduced for classical 



noncooperative games having no side constraints (see, 


e.g., |27j) and is given next. 




Definition. A Nash equilibrium of the game Q{X, 


6) is a strategy-price tuple (x*, vr) 


, such that 


X* G argmax {6'q(xq,x*g) - vr* 

Xq e Xq 


/(xg,x*g)}, Vg = l,...,Q, 


(27) 


and 






< vr* _L - 


/(x*) > 0. 


(28) 


A NE is said to be trivial if the power-component p* 


= for all g = 1, . . . , Q. 


□ 



In words, the proposed notion of equilibrium is a stable state of the network consisting of an equilibrium 
power /sensing profile x* and price vr*: at (x*, vr*), the SUs have no incentive to change their power /sensing 
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profiles x* based on the current state of the network [represented by (j27p ]. while the optimal value vr* of 
the price is such that all global interference constraints are met [a situation represented by (|28p ]. Note 
that, for a set of fixed price vr*, the equilibrium power /sensing profile x* can be interpreted as the NE of 
a classical noncooperative game (having thus only local constraints), wherein the payoff function of each 
player q is Oq{», x_g, vr*) and the strategy set is Xq. The proposed equilibrium concept is thus a NE of the 
aforementioned game with an appropriately selected price. 

The game Q{X, 6) is nonconvex with the nonconvexity occurring in the players' objective functions 
and the local/global interference constraints; moreover, the feasible price [satisfying (j28p ] is not explicitly 
bounded [note that this price cannot be normalized due to the lack of homogeneity in the players' opti- 
mization problem (f23|) ]. Because of that, the existence of a NE is in jeopardy. The rest of this section is 
then devoted to provide a detailed solution analysis of the game; we derive sufficient conditions for the 
existence and the uniqueness of a NE. 

Mathematically, a NE can be interpreted as a fixed-point of the players' best-response map. When 
this map is a continuous single valued function, the existence of a fixed-point can be proved by using 
the renowned Brouwer fixed-point theore nfl (see, e.g., |35| Th. 2.1.18]), provided that one can identify 
a convex compact set for the application of the theorem. Our goal is then to derive a set of sufficient 
conditions under which the best-response map associated with G{X, 6) is a single- valued continuous map 
over a proper compact and convex set; this is a nontrivial task, because of the nonconvexity of the players' 
optimization problems and the potential unboundedness of the price. The new line of analysis we propose 
is based on the following three steps: 

Step 1 : To deal with the unboundedness of the price, we introduce an auxiliary price-truncated game 
Qt{X, 0), where the price vr is constrained to be upper bounded by a given positive constant t; 

Step 2 : We derive sufficient conditions for the nonconvex players' optimization problems in the game 
Qt{X, 6) to have unique optimal solutions; building on such solutions we introduce a continuous 
single- value map— the best-response associated with the game Qt{X ^ 0)— defined on a convex and 
compact set, whose fixed-points are the Nash equilibria of the game Qt{^, We can then apply 
the Brouwer fixed-point theorem to deduce that Qt{'^ ■, has a NE; 

Step 3 : The final step is to demonstrate that there exists a sufficiently large t such that the price trun- 
cation in the game Qt{X, 0) is not binding. This will allow us to deduce that a NE of Qt{X, 0) is 
also a NE of the original, un-truncated, game Q{X, 0). 

Step 1: The price-truncated game Qt{X, 6) 

To motivate the price-truncated game, observe first that the price complementarity condition in (|28p is 
equivalent to 

vr* G argmax {vr • /(x*)} . (29) 

7r>0 

^Brouwer fixed-point theorem states that every continuous (vector- valued) function <& : C C defined over a nonempty 
convex compact set C C R" has a fixed point in C. 
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In order to bound the price vr in ([29|) . let us introduce the price interval defined as: given t > 0, 

St = {tt \ 0<TT <t} , (30) 

and truncate in (p9|) the nonnegative axis vr > by iSf. We then replace (|29p with the following price- 
truncated optimization problem: 

TT* G argmax {nt ■ /(x*)} , (31) 

where instead of vr we used vr^ to make explicit the dependence of the optimal solution of (j3ip on t. Using 
(j3ip . the price-truncated game Qt{X, 0) can be defined as follows. 



Game Gt{^, ^)- The game is composed of Q+1 players' optimization problems: the following nonconvex 
optimization problems for the Q players 



maximize Og (xg, x_g) — vr^ • /(x), q = 1, . . . ,Q, 



and the price-truncated optimization problem for the (Q + l)-st player 



maximize vr^ • /(x) 



(32) 



(33) 



Note that in the game ^j(Af, 0) there are no side constraints, but the price complementarity condition 
in ()28p is treated as an additional player of the game, at the same level of the other Q players. In fact, 
this formulation facilitates the solution analysis of the game, as detailed next. 

Let us start our analysis by rewriting the NE of Qt{X, 6) as fixed-points of a proper best-response 
map defined on a convex and compact set, which allows us to apply standard fixed-point arguments. 
Given t > 0, suppose that each optimization problem in ()32p has a unique optimal solution for every fixed 
x_q G y-q and vr^ G St (we derive shortly conditions for this assumption to hold; see Proposition [2] below); 
let denote such a solution by x*(x_g, vr^), i.e., 

x*(x_q. Tit) = argmax {Oq {Zq, x_q) - vri • I(Zq, X_g)} , (34) 

where in (|34p we made explicit the dependence of x*(x_g, vr^) on the strategy profile x_g of the other 
players and the price vr^. In order to have a unique solution also of the price-truncated linear optimization 
problem (|33p . we introduce the following proximal-based regularization in (p3|) : given t > 0, x G 3^, and 

TTt e St, let 

vr*(x, vrt) = argmax <^ fit ■ /(x) - - (^t - iTtf \ ■ (35) 

Note that, thanks to the proximal regularization, the optimization problem in (I35p becomes strongly 
convex for any given (x, vr^), and thus has a unique solution vr*(x, nt), which depends on (x, -Kt). Building 
on (|34p and (|35p . we can introduce the following best-response map B : y x St ^ y x St associated with 
the price-truncated game Qt{X, 0): 

/ xi \ / x^(x_i, -Kt) \ 



y xStB (x, vrt) 



^ S(x, vrt) 



XQ(x_g, vrtj 



(36) 
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Note that, even though the feasible sets Xq of the players' optimization problems in (j32p are nonconvex, 
the map B{») is defined over the convex and compact set 3^ x St; which is a key point to apply the Brouwer 
fixed-point theorem. Moreover, the set of fixed-points of B{») coincides with that of the NE of the game 
Qt(X, 0), establishing thus the desired connection between the map (|36p and the game Gti^, d). More 
formally, we have the following. 

Lemma 1. Suppose that each optimization problem in has a unique optimal solution for every given 
X-q € y^q and tti & St- A tuple (x*, vr*) is a NE of Qt{X, 6) if and only if it is a fixed-point of the map 
B{»); that IS (x^ <) = B (x^ <). 

Based on Lemma [H we can now study the existence of a NE of Qt{X , 6) by focusing on the fixed-points 
of the map B. 

Step 2: Existence of a NE of gt{X , 0) 

We provide now sufficient conditions guaranteeing that each nonconvex problem ([32|) has a unique optimal 
solution, for every given x_g G y-q and TTt £ St- Then, we show that these conditions are also sufficient 
for the existence of a fixed-point of the map B in (j36p . and thus a NE of the game (Af, 6). 

It is well-known that, under some Constraint Qualification (CQ), a locally /globally optimal solution of 
a (possibly nonconvex) nonlinear program satisfies the Karush-Kuhn- Tucker (KKT) conditions associated 
with the optimization problem; such solutions are called stationary solutions of the optimization problem. 
It turns out that to establish the single- valuedness of the players' best-response map it is enough to derive 
conditions guaranteeing the uniqueness of the stationary solutions, provided that a suitable CQ holds. The 
classical approach to write the KKT conditions of each player's optimization problem would be introducing 
multipliers associated with all the constraints in the set A'g— both the convex part and the nonconvex 
part /g(xg) < [cf. (jlSp j— and then maximizing the resulting Lagrangian function over the whole space 
(i.e., considering an unconstrained optimization problem for the Lagrangian maximization). The study 
of the uniqueness of the stationary solutions based on the "standard" KKT conditions is however not an 
easy task. To simplify the analysis, we propose here a different approach: instead of explicitly accounting 
all the multipliers as variables of the KKT system, for each player's optimization problem, we introduce 
multipliers only for the nonconvex constraints Iq{xq) < 0, and retain the convex part as explicit 
constraints in the maximization of the resulting Lagrangian function. More specifically, denoting by Xq 
the multiplier associated with the nonconvex constraint Iq{xq) < of player q, the Lagrangian function 
associated with the optimization problem (j32p of player q (rewritten as a minimization) is 

Cq{{Xq,Xq) , X_g, TTj) = -6'g(Xg,X„g) + Xg ■ /q(Xg) + TTt ■ /(Xg,X_q), (37) 

which depends also on the strategies x_g of the other players and the price vrj. Given x_g and TTt, it is not 
difficult to see that if x* is an optimal solution of the q-th player's optimization problem in (j23p and some 
CQ holds at x*, there exists a multiplier A* associated with the local nonconvex constraint Iq{xq) < 
such that the tuple (x*, Ag) satisfies 

(i) : x* € argmin{£g((xg, Ag),x_g,7rt)} 

(38) 

(ii) : < A^ ± -l,(x*)>0. 
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Note that each Lagrangian minimization in (i) is constrained over the convex part of the player's local 
constraints Xq. Since J^g is a convex set, we can invoke the variational principle for the optimality of x* 
in (i), and obtain the following necessary conditions for ()38p to hold: 



(i') : {^g-^*f Vx,A((x^,A*), x_„7rt) > Vx^ G yg 
(ii'): (A^_A*).(-/,(x*)) >0, VA,G]R+ 



(39) 



where (i') is just the aforementioned first-order (necessary) optimality condition of the (nonconvex) opti- 
mization problem in (i), albeit with a convex feasible set 3^^; and (ii ) is equivalent to (ii). Finally, since 
there is no coupling in the constraints involving the variables Xg and Ag in (i )-(ii ), we can equivalently 
rewrite the two separated inequalities (i )-(ii ) as one inequality, obtaining 



x„ 



x; 



Ao — An 



Vx,/:,((x*,A^), x_„7rt) 



>o, y{xg,Xg)eyqX 

— /V-fV 



(40) 



The above system of inequalities defines the so-called VI problem in the variables (xq,Aq) for fixed 
(x„g, TTt), whose defining vector function is Fq (•; x_g, irt) and feasible set is JCq, both defined in (|40p PI such 
a VI is denoted by VI(/Cg,Fg). According to the implications (|38p =^()40 p . the VI(/Cq,Fq) is an equivalent 
reformulation of the KKT conditions of the q-th player's optimization problem in (I23p . wherein the convex 
constraints J'g's (and thus the associated multipliers) have been absorbed in the VI set ICq, which is thus 
convex. It turns out that the nonconvex problem in (j23p has a unique optimal solution for any given x_g 
and TTt— the best-response of (j36p is unique, and thus x*(x_q, nt) is well-defined— if the VI(/Cq,Fq) has a 
unique Xg-component solution and some CQ holds. Proposition[2]below shows that Abadie CQ |35l Ch. 3.2] 
is satisfied by any nontrivial optimal solution of ()23p and establishes the uniqueness of the Xg-component 



under the positive definiteness of the Hessian matrix V'^^Cq ((xg, Ag), x_g, tti) of £g((xg, Xq) , x_q, vr^), for 
all (xg, Xq) G JCq and any given x_g G y^q and vtj > 0. The matrix V'^^Cq ((x^, A^), x_g, vr^) [interpreted 
as a function of (xg, Xq), for fixed x_g and vr^] is given by 



Vx„'Cg((x„,AJ, X^q,TTt) 



V2 



Xq, X„g) + Xq ■ /„(X„) + TTt ■ /(Xg,X_„). 



(41) 



Lemma [12] in Appendix [A] shows that all the Ag-solutions of the VI(/Cq,Fq) are bounded from above, 
for every given x_q G and vrt G 5^. Specifically, it holds that any A* satisfies A* G [0, A'^^'^] (see Lemma 
[12] in Appendix [A]), with 



^max A ^ ^ 

q=l 



1/ 



min <; C'^^ min p^l^ 



mm < 

l<k<N 



log 



1 + 



^max 



(42) 



mm 

l<fc<Af 



^Given a set Q C R" and a vector-valued function ^' : Q — i- R", the VI(Q, ^) problem is to find a point z* G Q, termed 
a solution of the VI, such that (z - z*)'^*(z*) > for aU z £ Q [35] . 
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This allows us to restrict the requirement on the positive definiteness of V^^Cq ((xg, Ag), x_g,7rj) on all 
G yq and Xq £ [0, A™'^^]. The above discussion is made formal in the following proposition. 



Proposition 2. Let x_g G y^q and vtj G St for some t > 0. Suppose that V^^Cq {{xq, Xq), x_g,7rj 

is positive definite for all Xg G and Xq G [0, A™'^^] . Then, the q-th nonconvex optimization problem 
in has a unique optimal solution x* G Xq that is necessarily nontrivial. 

Proof. See Appendix Rl □ 

Note that under conditions in the above proposition, the optimization problems in (|32p remain non- 
convex (the constraint set Xq is indeed nonconvex). To shed light on the physical interpretation of the 
obtained result, we provide in Corollary [3] below easier conditions to be checked (but more restrictive) 
under which Proposition [2] is true. To state the corollary, we use as weights Wq^s involved in the interfer- 
ence constraints ([7]) and ([5]) the cross-channels between secondary and primary users, i.e., tt;^ ^ = Gp^q{k), 
for all g = 1, . . . , Q and k = 1, . . . ,Q (more general conditions are given in Appendix [A]) . 

Corollary 3. Proposition\^ holds if the following sufficient condition is satisfied: 

,(.). ^„ |IW|<1. (43) 

where is a positive constant that depends only on system/ sensing parameters and it is defined in IllOS^) 
(cf. Appendix W\) 

Proof. See Appendix [Bl □ 

The condition in (j43p has an interesting physical interpretation: the nonconvex problem in ()32p has 
a unique solution provided that the (normalized) cross-channels between the secondary and the primary 
users are "sufficiently" small, meaning that there is not "too much" interference at the primary receivers; 
see Sec. 14.31 for more details on the physical interpretation of the above conditions. 

Based on Proposition [2] and Lemma[Tl we can now establish the existence of a NE of the game Qt{X, 0) 
invoking the existence of a fixed-point of the single- valued mapping B{») defined in (|36p . 

Proposition 4. Given t > 0, suppose that each matrix V'^^Cq ((xg, Xq), x^q,TTt) in [^\ ) is positive definite 
for all {xq,Xq) G X [0, A™^^], x_g G 3^q, and tti G St. Then, the game Qt{X, 9) has a (nontrivial) NE. 

Proof. Under the positive definiteness of each matrix V^^Cq ((xg, Xq), x_g, VTf ), the optimization problems 
(I34p and (|35|) have a unique optimal solutions Xg(x_g, 7r()'s and vr^(x, vrj), respectively, for any given 
X £ y and TTt G St. Since these optimal solutions are unique, it is not difficult to show that they are 
continuous functions of the parameters (x, lit) (see, e.g., |39|). implying that the single-valued map B 
in ([36|) is a continuous function on the convex and compact set y x St- It follows from the Brouwer 
fixed-point theorem, that B has a fixed-point, which is a NE of the game Gt{X, 6) (Lemma [TJ. It follows 
from Proposition [2] that such a NE must be nontrivial. □ 
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Step 3: Existence and uniqueness of a NE of the game G{^, 0) 



To pass from a NE of the price-truncated game Qt{X, 0) to a NE of the original game Q{X , 6), we argue 
that there exists a sufficiently large t > such that the truncation constraint vr^ < t in St is not binding 
at the optimal solution vr* of the price-truncated optimization problem (j35p . corresponding to a NE of 
Qt{X, 0). This implies that a NE of Gt{X, ^) is also a NE of Q{X, 9) and, as such, existence conditions 
given in Proposition [5] for the game Qt{X, 6) apply also to Q{X, 6). This is made formal in Theorem [5] 
below, where we derive sufficient conditions for the existence and uniqueness of a NE of Q{X, 6). 

To introduce the theorem, we follow a similar approach as in Step 2: i) we first write the KKT 
conditions associated with the game Qt{X, 9), which under some CQ, are necessary conditions for a tuple 
(x*, vr*) to be a NE of Qt{X , 0) along with some multipliers associated with the local nonconvex constraints 
{/g(xg) < 0, g = 1, . . . , Q} and the truncation in St] and then ii) we rewrite this KKT system as a proper 
VI problem, whose solution analysis leads to the desired results (c.f. Theorem [5|). 

Under a suitable CQ, every NE (x*,7r*) of Qt{X , 6) will satisfy the KKT conditions of the game, 
which are obtained by aggregating the KKT conditions of players' optimization problems in (p2|) and (j33p . 
Denoting by A* and 77^ the multipliers associated with the nonconvex constraint /g(x*) < of player q 
and the price truncation tt* <t m St, respectively, and proceeding as in (f38|) -()^0|). the KKT conditions of 
Qt{X, 0) that are necessarily satisfied by any NE (x*,7r*) can be written as: 



(i): 



V XQ 



XT 



X? 



j \ Vxq£q((x^,A^),x*q,0 j 



> 0, Vxg G yq and q = 1, . . . ,Q, 



I (Ai-At) \ 



i -A(xt) \ 



(44) 



>0, VA„ > and q = \,...,Q 



(iii) : < < _L -/(x*) ^ rft > and < r?^* _L t - < > 0. 

Observing that the complementarity conditions in (iii) of (l44l) are equivalent to the VI problem in the ixt 
variable: 

(vr^ - • (-/(x*)) > 0, V^tG5t, 
the KKT system (j44p can be equivalently rewritten as 



/ x-x* \ 
A - A* 



T 



I (V.,A((x^,A^),x%,<))^^^ \ 



V 



-/(x'^) 



> 0, V(x, A, vrt) G 3^ X X St 



J 



(45) 



^*(x*,A*,7r*) 

which represents a VI problem in the tuple (x, A, vr^), i.e., W{Zt, with x = (xg)?^-,^ and A = (Ag)?^^. 
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Based on the VI formulation (|45p . in Appendix [Cl we prove that the fohowing two properties are 
satisfied by any solutions (x*, A*, vr*) of Yl(Zt, ^) and thus by any NE of Qt{X, 6) (under some suitable 
CQ): i) at any (x*, A*, vr*), vr* is bounded from above by vr^ < A™'''', with A™'^'' defined in (glD; and ii) 
the x-component of (x*, A*, vr*) is unique if the Jacobian matrix of (Vx,>Cg((xg, Ag), x_q,7r4))^-^ with 
respect to x, denoted by A(x, A, vTf), is positive definite on y x [0, A™'*^]'^ x 5t, with A(x, A, iTt) given 
by: 

/ Vxi£i((xi, Ai), x_i,7rt) \ 



A(x, A, Tit) = J 



(46) 



\ Vxg/:Q((xQ, Aq), x_Q,7rt) I 

Building on the established connection between the NE of Qt{X, 6) and the solutions of the VI(^(, ^) 
and using properties i) and ii) above, we can finally obtain the desired existence and uniqueness result: 

(a) It follows from property i) that since the truncated game Qt{X , 0) has a NE for t > A™^^ (which is 
guaranteed under conditions in Proposition the original game Q{X, 6) must have a NE as well; and 

(b) According to property ii), if there exists a t > A™^^ such that A(x, A, Tit) is positive definite for 
all (x, A, TTt) G 3^ X X St, the x-component of the solution of the \l[Zt, 'J')— and thus of the NE of 
Q{X, 0)—\s unique. These results are collected in Theorem [5] below and formally proved in Appendix [Cl 

Theorem 5. Given the game G{X, 6) and A™^^ defined in [JW, the following hold: 

(a) Suppose that there exists at > X"^^^ such that each matrix V'^^Cq ((xg, Ag), x_g, vr^) in ( [^ j[ ) is positive 

definite for all (xg, Ag) G % x [0, A"^^^], x_g G 3^g, and TTt e St- Then, every NE (x*, vr^*) ofGt{X, 6) 
is a NE ofg{X, 0); therefore g{X, 6) has a NE; 

(b) // the condition in (a) is strengthened by the following: the matrix A(x, A, vr^) in is positive 

definite for all :x. y, AG [0, A™*^]*^, and vr^ G St, then the 'x.-component of the NE of the game 
Q{X, 6) is unique. 

Proof. See Appendix [Cl □ 

Sufficient conditions for the matrix A(x, A, vtj) to be positive definite are given in the following. 

Corollary 6. Statement (b) [and thus also (a)] of Theorem\^ true if the following sufficient conditions 
are satisfied: for all q = 1, . . . ,Q, 

where 7g"^^ and 7g^^ are positive constants depending only on system/ sensing parameters and are defined 
in \102\) and ill9\) . respectively (cf. Appendix\D^. 

Proof. See Appendix [Dl □ 
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4.3 Discussion on the existence/uniqueness conditions 

Corollary [3] and Corollary[6]suggest an intuitive physical interpretation of the equilibrium existence/uniqueness 
conditions: existence of an equilibrium and uniqueness of the x-component are ensured if the MUI in 
the network is sufficiently small (compared to the background noise). More specifically, existence re- 
sults in (|43p impose a limit (only) on the maximum interference that the the SUs are allowed to gen- 
erate at the primary receivers, measured by ^ max^ ||Gpg(/c)p//*°*}. Uniqueness conditions in ()47p 
impose instead a limit on the maximum MUI experienced at both primary and secondary receivers. 

This is clear looking at the LHS of (gT]): the first term on the LHS, max {|Gp<j(/c)p/I*°*}, co- 

k=l,...,N 

incides with that of (j43p . imposing thus a limit on the MUI at the PU, whereas the second term, 

^r=iq ^^■^Ar { l'^'?'' I } + ^r=iq '"i^'^N { I I ^/'^^.^ } ' ovcrah MUI in the Secondary 

network; indeed, the quantity X^r^-^o max < \Hrg{k)\'^ /a"^ > is an estimate of the maximum interference 

' k=i,...,N L ' J 

generated by each SU q against all the other SUs r's, and ^^.j^g max {\Hqr{k)\'^ /a^ f^} can be inter- 
preted as a limit on the maximum MUI tolerable by each secondary receiver q and generated by all the 
other secondary transmitters r's. These two sources of MUI affect the uniqueness through the constants 
7g"^^ and 7^^^ which depend on the fixed sensing/device-level parameters as well as on the SU/PUs' QoS 
requirements (e.g., maximum false alarm rate/minimum detection probability, and maximum sensing time 
constraints) . 

Interestingly, conditions in ()47p are of the same genre as those obtained in the literature to guarantee 
the uniqueness of the NE of convex games modeling the power control problem in ad-hoc networks |18| 
HOl [T9l [20] and CR systems |4H 121) . The main difference is that, because of the nonconvexity of some 
constraints and the joint optimization of sensing and transmission strategies, in (I47p . there is an extra term, 
max UGpq{k)\'^ / \ , limiting the interference generated also against the PUs and the two weights 

^q^^ and 7q^^ capturing the sensing/QoS requirements. 



5 Distributed Algorithms 

This section is devoted to the design of distributed algorithms that solve the proposed class of games and 
the study of their convergence. Before analyzing the most general game Q{X, 6), we focus on solution 
methods for the game where the price vr is a fixed exogenous parameter (and thus there are only local 
interference constraints). The resulting algorithms will be used as a subroutine in an extended iterative 
algorithm solving the more complex game G{X, 6) wherein the prices are endogenous variables to optimize. 

5.1 Game with exogenous price 

When the price vr is an exogenous fixed parameter, game Q{X, 6) reduces to the following game. 
Game Q.,^{X, 6). The optimization problem of player q is: given x_g G X^q and vr > 0, 

maximize 9q (xg, x_q) — vr • /(xg, x_g) q = 1, . . . ,Q. (48) 
We have denoted such a game by GTr{X, 6), making explicit the fact that vr is an exogenous fixed 
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parameter. Note that Gni^, ^) contains as special cases the game with zero price (and thus no global 
interference constraints) as introduced in Sec. 13.11 and the equisensing game with constant price tt (and 
local interference constraints only), which is an instance of the game Q{X, 9) introduced in Sec. 13.21 
Therefore, Algorithms for Q-,^{X, 0) apply also to the aforementioned special cases. 

We are interested in iterative schemes based on the best-response mapping: according to a given 
scheduling (e.g., sequentially, simultaneously, or asynchronously), each SU solves his own optimization 
problem in (I48p . given the strategies of the others. If this procedure converges and some suitable conditions 
are satisfied, it will converge to a NE of the game Qt^{X, 6). The Jacobi version of the proposed class of 
algorithms wherein the users update their strategies simultaneously is formally described in Algorithm [TJ 

Algorithm 1: Jacobi Best-Response-Consensus Algorithm for Qt^{X, 0) 

(5.0) : Choose any feasible x^'^^ G X and set n = 0. 

(5.1) : If x^") satisfies a suitable termination criterion: STOP. 

1 ^ fi'^) 

(5.2) : Run a consensus algorithm to locally compute the average — 

Q V fr 

(S . 3) : f or (/ = 1, . . . , Q, compute 

x("+i) e argmax {o^ (x„ xL"^) - vr • /(x^, x_,) | . (49) 

(S.4) : n ^ n+ 1; go to (S.l). 



In order to relax constraints on the synchronization of the players' updates, totally asynchronous 
schemes (in the sense specified in |42| ) can be considered, where some SUs may update their strategy profile 
more frequently than others and they may even use an outdated measurement of the interference generated 
by the others (we refer to [l2] and |20| for a formal description of asynchronous algorithms). The analysis of 
this general class of algorithms is addressed in Appendix |E1 where we provide sufficient conditions for their 
convergence; see Theorem [16] and Corollary [T71 Since Algorithm [1] is an instance of these asynchronous 
schemes, it converges under the same aforementioned conditions. It is worth remarking that the obtained 
convergence conditions have the same physical interpretation of that given for the existence/uniqueness 
of the NE (cf. Sec. 14. 3p . Roughly speaking, they require "low" interference in the network, meaning 
"small" values of the (normalized) secondary cross-channels \Hq,i.{k)\^ / a"^ as well as secondary-primary 
cross-channels |Gpq(/c)p//*°*. Interestingly, they do not depend on the specific updating scheduling used 
by the users, meaning that the whole class of asynchronous algorithms converges under the same set of 
unified conditions. The main implication of this result is that all the algorithms obtained as special case 
of the asynchronous scheme, such as the sequential (Gauss-Seidel scheme) and the simultaneous (Jacobi 
scheme) best-response algorithms, are robust against missing or outdated updates of the players. 

5.1.1 Discussion on the implementation 

We discuss now some implementation issues related to the proposed algorithms; for notational simplic- 
ity, we will focus only on Algorithm [H but similar conclusions can be drawn also for the asynchronous 
implement at ion . 
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In Step 3 of the algorithm, each user q needs to compute its best-response, knowing the information 
on the strategies of the others = (xr"^)^^^-,^, with each x,. = (?>, p,., Pl^). Given the structure of the 
feasible set Xq [specifically, the presence of local interference constraints ([7])] and the functional dependence 
of the objective function in ()48p on x_q [see (j23p ]. this knowledge requires each SU q to estimate: i) the 
overall Power Spectral Density (PSD) of the MUI at each subcarrier, X^^-^^ |-f^(jr(^)PPr,fc; h) the primary- 
secondary cross-channel function {Gpq{k))^^^ [if the weights Wg^^s in the local interference constraints ([7]) 

are chosen as Wq^k = Gpq{k)]; and iii) the average of the (normalized) sensing times (l/Q) Yl?=ii'^r/^/ /i^^) 
of all the SUs. Among other remarks, we discuss next alternative distributed protocols to obtain these 
estimates, each of them being characterized by a different level (albeit limited) of signaling among the 
SUs and computational complexity. 

Estimate of the MUI and the primary-secondary cross-channels 

To measure the MUI in a totally distributed way, it is enough for the SUs to perform a preliminary 
noise calibration of their receivers (during this phase of course the SUs must stay silent). After this 
noise calibration phase, to acquire the MUI, the SUs just need to locally measure the global interference 
experienced at their receivers. Note that this procedure does not require the SUs to be able to distinguish 
between primary and secondary signaling. 

Because of the presence of the individual interference constraints in the set Xq, each SU needs to 
estimate also the secondary- primary cross-channel transfer function {Gpq{k))^^^ [if in ([7]) one uses Wq^^ = 
Gpq{k)]. This knowledge can be acquired by each SU in advance by using classical channel estimation 
techniques, and updated at the rate of the channel coherence time. In the CR scenarios where the PUs 
cannot communicate with the SUs (e.g., when the PUs are legacy systems) and thus cannot be involved 
in the (cross-)channel estimation, and the primary receivers have a fixed geographical location, it may 
be possible to install some monitoring devices close to each primary receiver having the functionality of 
( cross- ) channel/ interference measurement . 

In scenarios where the above options are not feasible and the channel state information cannot be 
acquired, a different choice of the weights coefficients Wq^s and the interference threshold I^^^ in ([7]) can 
be made, based on worst-case channel/interference statistics. More specifically, one can replace the in- 
stantaneous value of the secondary-primary cross-channel transfer function (Gpg(fc))^^ with its expected 
value; the expected value of each Gpq{k) is 

where is a positive constant depending on the number of resolvable paths and their variance; q is the 
path loss exponent, which generally is 2 < <j < 6; dp^ is the distance between the SU q and the PU; and 
do is the Fraunhofer distance. The interference constraints imposed to each SU q become then 

N 

g ■ i + (W ■ - '''' 

which is still in the form of ([7]), with weights coefficients Wq^k = + {dpq/ d^Y). 
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When the distance dpq in (|5ip is unknown, one can instead consider a probabihstic (conservative) 
version of (|5ip . based on the worst-case interference scenario, as proposed in |43| . Modehng dpq as a 
random variable, we can impose 



N 



Prob \^P-r {r,, Pt) ■ 1 ^ ( J^/^^). • P,,k < /r^l > Pi, (52) 

where < P/ < 1 is a given positive constant guaranteeing the desired QoS at the primary receiver. To 
obtain an exphcit expression of the probabihty above, we consider next a more conservative constraint 
implying (j52p . More specifically, denoting by dmin — min^dpg the distance between the PU and the 
nearest SU g, the following interference constraint implies ([52|) : 

Prob {I FX (r„ F/.) • < j > p,. (53) 

Assuming that the SUs are randomly distributed according to a homogeneous Poisson point process with 
spatial density p, dmin — niing dpq is Rayleigh distributed; the probability in (j53p can be then evaluated 
in closed form and we obtain 1431 



which is still in the form of ([7]), where Wq,k = 1 and the interference threshold J™^^ is replaced by 1^ 



rmax 



Estimate of the average sensing time [Step 2] 

The average of the sensing times can be locally computed by each SU by running a consensus based 
algorithm that requires the interaction only between nearby secondary nodes, as stated in Step 2. Con- 
sensus algorithms have become popular over the past few decades since |44) as a practical scheme for the 
in-network distributed calculation of general functions of the node values; several protocols suitable for dif- 
ferent applications and working under different network settings have been proposed and their properties 
analyzed; see, e.g., |45| I46| for a good overview of recent results. In order to minimize the running time of 
the consensus iterates and thus the amount of signaling to be exchange in Step 2 by the SUs, we suggest 
here to use the finite-time distributed convergence linear scheme proposed in |47| . The main advantage 
of this scheme with respect to the more classical consensus/gossip algorithms whose convergence is only 
asymptotic (i.e., exact consensus is not reached in a finite number of times) is that, at no extra signaling, 
each node can immediately calculate the consensus value after observing the evolution of its own value 
over a finite number of time- iterations (specifically, upper bounded by the size of the network). 

The consensus scheme we consider in Step 2 of Algorithm [2] makes use of the following liner iterations: 

^(n) 

given the (normalized) sensing times Tq s obtained as output of Step 3 at iterations n, and setting 

JO) _ ^(n) 



Tq I \Jfq, each SU q updates at each (inner) time- iteration i its value as 



't^^^ — a.qqZ^q^ -|- ttqr \ zf^ '^q) ('^^) 
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where Nq is the set of neighbors of user g, which are the nodes that interfere with node q (the SUs' network 
is modeled as a directed graph); the cardinahty of Mq, the number of neighbors of node is denoted by 
degq" = \N'q\ (also called in the graph theory jargon the in-degree of node g); and the set of 

given coefficients. These weights represent a degree of freedom in the algorithm design; here we focus on 
the following choice that can be made locally by each SU q: 



''qr 



1, 

0, 
F 



degl; 



if r G AAg 
if r = g, 



(56) 



where F is any integer number. Associated with the SUs' network topology, there are some absolute 
quantities that play a role in the stopping criterion of the iterates (j55p and the computation of the final 
consensus value. More specifically, for each node q, there exist a scalar < Lg < Q — deg^ and a {Lq + 1)- 



length vector uig E M^'"'"^ having the following properties |47| : given the samples 
by the SU q in the first Lg + 1 iterations of (j55|) . it holds that 



.(0) 



4^''' collected 



.(0) 



Q ^ 

r=l 



r=l 



(57) 



According to (l57l) . each SU q can obtain locally the desired average of the sensing times after running the 
linear iterates (I55p for Lq + 1 time-steps; this will require at most Q — deg^ + 1 time- iterations. Note that. 



(0) 



to calculate the quantity in ()57p , the SUs do not need to store the entire set of samples Zq , . . . , ^q 
instead one can compute the scalar product in (|57p incrementally, as the iterations progress. 

To implement the above protocol distributively, each SU q has to preliminarily estimate his own Lq 
and lUq] for time-invariant topologies this can be done just once; the cost of this computation will then be 
amortized over the number of times the consensus algorithm is performed. In |47| . the authors proposed 
a decentralized protocol still based on the updating (I55p to perform such a computation in (at most) 
Q{Q — 1) iterations; we refer the interested reader to |47| Sec. V] for details. The consensus protocol 
discussed above is formally described in Algorithm [2] below, which represents the subroutine to implement 
Step 2 of Algorithm [H 

Algorithm 2: Finite-time Consensus Algorithm in Step 2 of Algorithm [1] 

Data : Tg"^ [from Step 2 of Algorithm [T] , Lq, m^, and (a^r)^!, for all q = 1, . . . ,Q. 
Tg"^ , for q = 1, . . . ,Q and set i = 0. 



(S.2a) : Set 4°^ 
(S . 2b) : for i = 1, . . . , maxg Lq, 



— Each SU g updates Zq according to (|55 

- if i =: 
end (for 



Lq for some q, then SU q computes ()57p and gets — , 



1 ^ 



Q 



r=l 



In Algorithm [21 the number of iterations i required to propagate the consensus over the whole network 
is maXg{Lg} + 1 < Q — mingjdegg} + 1. One can reduce such a number by slightly changing the above 
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protocol: SU q runs the iteration (j55p for Lq + 1 consecutive time-steps, or until he receives the consensus 
value from a neighbor. If Lg + 1 iterations passes without receiving the consensus value, SU q calculates 
that value and broadcast it to his neighbors, along with a flag indicating that it is the consensus value 
(and not just an intermediate value). In this way, "slower" SUs r's will receive the final value at most one 
iteration after node q. 

On the time-complexity and communication costs 

We quantify now the complexity of Algorithm [1] (whose Step 2 is implemented using Algorithm [2|) in terms 
of the minimum number of iterations required to reach the desired convergence accuracy and communica- 
tion costs (number of message passing among the SUs). Both results come readily from the following two 
facts. 

Fact 1. The convergence conditions of Algorithm [1] as given in Theorem [16] in Appendix [El are based on 
the contraction properties of the best-response mapping Bt^^{^ = (x*(x_q, T^t))^_^ associated with the 
game Q.,^{X, 6) in ([^8]) . with each x*(x_q. Tit) defined in (j34p : under assumptions in Theorem [T6l there 
exists a constant cg G (0, 1) such that [see (jl39p in Appendix [E] 

||^^,(x)-e,,(y)|| <CB ||x-y||, Vx,yG;f, (58) 

where an explicit expression of the contraction constant cq is given in (|139p (cf. Appendix lE.ip . If the 
"suitable termination criterion" in Step 2 of Algorithm [T] is chosen as the smallest iteration n = rimin at 
which the relative error ||i3,rt(x^"^) — x*|| / Hx^''^ ~ ■'^*|| ^^^^ than a prescribed tolerance Emax > [with 
x* being the NE of g^{X, 0)], <^ leads to 

n..>i^^f(^, (59) 
ln|cB| 

which provides the number of iterations n required for Algorithm [1] to reach convergence (within the 
accuracy Cmax)- 

Fact 2. The consensus algorithm described in Algorithm [2] was shown to converge in at most maXgjLg} + 
1 < Q — minqjdeg^} + 1 iterations. The communication cost incurred by the protocol can be characterized 
as follows. Given the directed graph modeling the network topology (the outgoing edges from each node q 
link the nodes associated with the SUs who receive interference from SU g), each SU q transmits a scalar 
value on each outgoing edge at each time-step i; since there are at most maXg{Lg} runs, each SU q will 
have in principle to transmit (maXg{Lg} + 1) • deg°"* messages, where deg°"* is the out-degree of node 
q (i.e., the number of SUs having user q as interferer). Thanks to the broadcast nature of the wireless 
channel, however, a single transmission of each user q will be equivalent to communicating a message along 
each of degg"* outgoing edges, and thus each node would only have to transmit maXg{Lg} -|- 1 messages. 
Summing over all nodes in the network, there will be X^^i(niaX|j{Lg} -1- 1) overall messages that have to 
be transmitted to run the consensus protocol. 

Using Facts 1 and 2 above, one can conclude that Algorithm [1] (whose Step 2 is implemented by 

Algorithm [2]) converges (within the accuracy Cmax) in ^"^^ • (max„{L„} + 1) (outer plus inner-loop) 

ln\ct3\ 

iterations, which is also the number of per/user message passing. 
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A special case: fixed equi-sensing times 

In the scenarios where no coordination is ahowed among the SUs to run a consensus algorithm, one can 
implement a special case of Algorithm [H where the SUs' sensing times are fixed a-priori and thus not 
optimized. This would correspond to solving the game GjriX, 9) in (|48p where the sensing times Tq are 
fixed and equal to a common value r; the resulting solution scheme will be like Algorithm [1] where there is 
no Step 2 and the optimization problems in (j48p are solved only with respect to the tuple (pg, Pg^), given 
Tq = T. The time and communication complexity of such an algorithm is of the same order of that required 
by the well-known iterative waterfilling algorithm proposed and studied in many papers \18\ [T9l [20l [M] 
to distributively solve the rate maximization game over interference channels (there is no optimization of 
the sensing part in any formulation of that game). The price in the reduction of signaling obtained with 
the fixing of sensing times may be paid in terms of overall performance; in Sec. [6l we numerically quantify 
the loss in using a fixed sensing time rather than optimizing it. This sheds some light on the trade-off 
between performance and signaling in the proposed games. 

On the best-response computation 

A last comment deals with the computation of the best-response of each optimization problem (148p , which 
would require the capability of solving a nonconvex problems. This is not a difficult task under the 
assumption of Theorem [TU] (cf . Appendix [E]) , which ensures that each of such (nonconvex) optimization 
problems has a unique stationary point (cf. Proposition [2|) that can be computed by any of nonlinear 
programming solvers, provided that each SU q has the information on the strategies x_q of the other SUs. 

Finally, observe that, when conditions in Theorem 1 161 are not satisfied, every limit point of the sequence 
generated by the proposed algorithms, wherein the best-response solution is replaced by a stationary 
solution, has still some optimality properties: it is guaranteed to be a QNE of the game, whose properties 
have been studied in our companion paper |26) . 

5.2 Game with endogenous prices 

We focus now on distributed algorithms for solving the general game G{X, 0). The main challenge here is 
to obtain distributed algorithms in the presence of coupling nonconvex constraints. The proposed approach 
is to reduce the solution of the nonconvex game Q{X, 6) with side constraints to a solution of a sequence 
of (compact and) coni^ealf] games of a particular structure with no side constraints. The advantage of 
this method is that we can efficiently solve each of the convex games with convergence guarantee using 
the best-response algorithms introduced in Sec. 15.11 for the game Gni^, d) with exogenous price; the 
disadvantage is that, to recover the solution of the original game Q{X, 0), we have to solve a (possibly 
infinite) number of convex games. However, it is important to remark from the outset that this potential 
drawback is greatly mitigated by the fact that, as we discuss shortly, (i) one only needs to solve these 

•^According to the terminology introduced in |27) . a game is said to be compact and convex if: i) the feasible set of each 
player is a convex and compact set; and ii) the cost function of each player (to be minimized) is a convex and continuously 
differentiable function of the strategy of that player, for any given strategy profile of the other players. The desired properties 
of such games are: i) each player optimization problem is a convex problem and thus it can be solved using efficient numerical 
algorithms; and i) they always have a NE. 
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games inaccurately; (ii) the (inaccurate) solution of the NEPs usually requires little computational effort; 
and (iii) in practice, a fairly accurate solution of the original game G{X, 0) is obtained after the solution 
of a limited number of games in the sequence. 

Before introducing the formal description of the algorithm, let us begin with some informal observations 
and intermediate results motivating how the sequence of convex games is built; the mathematical details 
can be found in Appendix [Fj At the basis of our analysis there are two results, namely: i) an equivalence 
(under some conditions) between the game Q{X^ 6) and the VI(^j, ^) introduced in (145p : and ii) the 
reformulation of the Yl{Zt, ^) as a convex game with no side constraint. The former connection, which is 
made formal in Lemma [7] below, allows us to remove side constraints from the game Q{X 6), whereas the 
latter, given in Lemma [8] below, paves the way to the use of best-response algorithms for convex games 
with no side constraints, as introduced in Sec. 15.11 

Lemma 7. Given the game G{X, 6), suppose that there exists some t > \^^^ such that the matrix 
A(x, A, -Kt) in (4^ is positive definite for all x £ y, \ £ [0, X'^^'^jQ ^ and irt G St- Then Q{X, 6) is 
equivalent to the VI{Zt, which always has a solution. The equivalence is in the following sense: for 
any solution {x^', ttY^) G Zt of the VI, the tuple {x^\ n}"') is a NE ofQ(X, 9); conversely, the game 
G{X, 0) has a NE (x*,7r*), and for any such a NE there exist multipliers A* G [0, A™'^]*^ associated with 
the nonconvex constraints {/^(x*), g = 1, . . . , Q} such that (x*. A*, vr^) is a solution of the VI{Zt, 

Sufficient conditions for A(x, A, vr^) to be positive definite along with their physical interpretation 
are given in Sec. 14.31 (cf. Corollary [6]) . Under conditions of Lemma [71 one can solve the Yl{Zt, ^) and 
obtain the NE of the original game G{X, 0). Since we are interested in using best-response algorithms as 
those developed in Sec. 15.11 for games with exogenous price and no side constraints, we rewrite next the 
W{Zt, ^) as a game, and then use best-response algorithms to solve that game. More formally, let us 
introduce the following game with no side constraints wherein the players, anticipating rivals' strategies, 
solve 

{i) : minimize £g ((x^, Ag), x_q, ttj) , q = l,...,Q 

(ii) : minimize -A^ • /^(xg), q = l,...,Q, (60) 

(Hi) : minimize —irt ■ /(x). 
The following connection holds between the above game and the Yl{Zt, 

Lemma 8. Under the setting of Lemma\% the VI{Zt, ^) is equivalent to the game in \60\). which always 
admits a NE. 

Note that the game in (|6U|) is composed of 2Q + 1 players. The first Q players in (i) correspond to the 
players of the original game G{X, 0)— the SUs in the system— that now optimize a different cost function, 
which is the "Lagrangian" function associated with their original cost functions in (|23p . for a given set 
of price vr^ and multiplies A. In addition to the Q SUs, there are Q + 1 more players solving problems 
(ii) and (iii); they act as virtual players who aim to compute the optimal multipliers Ag's associated with 
the nonconvex local interference constraints {/^(xg), q = 1, . . . , Q} and the optimal price irt, respectively. 
By introducing these virtual players, the original game G{X, 6) can be transformed (under the setting of 
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Lemma [7]) into the desired (compact and) convex game with only local constraints, which paves the way 
to the design of best-response algorithms for the game Gi^, 0)- 

We proved in Appendix [E] that the best-response algorithms introduced in Sec. 15.11 converge mider 
conditions implying the uniqueness of individual player's optimization problems. The game in the form 
(|60p however may never satisfy such conditions; indeed, the linear programming problems in (ii) and (iii) 
have multiple optimal solutions whenever some /g(xg) = or /(x) = 0. To overcome this issue, we follow 
a similar idea as in Step 1 of Sec. I4.2l and introduce in (ii) and (iii) of (I60p a proximal-based regularization 
of the A- variables and price Tr^, so that the resulting modified optimization problems become strongly 
convex. Given the center of the regularization of the A- variables, say A'^ = (A^)^]^, and the price vrj, say 
TT^, and the proximal gain a > 0, the regularized version of the game in (f60l). denoted by Gt{X ,6 ,^:^^) 
is the following. 



Game ^t(A', 0, A*^, vrf ). Anticipating rivals' strategies and \ 


.iven A" 4 {X^g)%, vr?, 


and a > 0, the 


2Q + 1 players solve the following optimization problems: 






minimize Cg {{xg,Xg), x^q,TTt) , 

Xq&yq 


q = l,...,Q 




minimize -Xg ■ Ig{xg) + — (A„ - A°) 


q= 1,...,Q, 


(61) 


minimize -tti ■ /(x) -|- — [iTt - vr^^) 







The main (desired) property of game Gt{X ,tt^ ) is that, under the setting of Lemma [3 the NE 
is unique and it can be computed with convergence guarantee using best-response algorithms as those 
introduced in Sec. 15.11 (we make formal this statement shortly). Nice as it is, this result would be of no 
practical interest if we were not able to connect the solutions of Qt{X , , , ir^) with those of the game in 
(I60p and thus the original game ^(Af, 6). In fact, the solution of ^^(A', 0, A*^, vr^) and (i60l) are in general 
different but, nevertheless, there exists a connection between them, as stated in the following lemma. 

Lemma 9. Under the setting of Lemma\% a tuple (x*, A*, vr*) is a NE of the game in \60\) if and only if 
it is a NE of the game Qt{X , 9, A*, vr*). Therefore, such a (x*, vr*) is a NE of the original game Q{X , 6). 

Providing the relationship between Q(X,9), the game in (1601) . and ^((Af, 0, A°, 7rj°), Lemma El opens 
the way to the design of best-response algorithms that solve the original game Q{X, 0): instead of solving 
Q{X,6) directly, starting from an arbitrary regularization tuple (A'^,7r^) > 0, one can solve the sequence 
of games Qt{X, 9, X^, vr^ ) —)••••—)• Gt{X, 9, A", vr") —)••••, where the center (A", vr") of the regularization 
of the game at stage n is just the (A, vr()-component of the (unique) NE of the game Qt{X, 9, A""^, vr"~^) 
in the previous stage. If this procedure converges, it must converge to a tuple (x*, A*, vr*) that necessarily 
is a NE of the game Qt{X, 6, A*, vr*), which implies by Lemma [9]that (x*, A*) is also a NE of the original 
game Q(X,9). A flow-chart with the connection of all these games along with an informal description of 
the above ideas is given in Figure [TJ 
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(a) 



Game g{X,e) 



(x, 7r) = game variables 
X = players' strategy profile 
TT = price variable associated with 
the side constraint 7(x) < 
(x*,7r*) =NE 



(d) 




Solve the sequence 


of games GtiX , , \" , tt',^) 




n— J-oo 


(x*.~,A*'~,7r;'~) 


= NE of5i(;t,6»,A~,7r~) 



A (x, \,ir) >~0 
[Lemma 7] 

{x*,7r*) = (x-,7r-) 



A (x, A, 7I-) >- 
[Theorem 10] 




(b) 



(x, A, tt) = VI variables 
X = primal variable 
A = dual variables of {Ig{xg) < 0, q = I, . . . , Q} 
TT = dual variable of /(x) < 

{x^',A'",7r^') = VI solution 



(x*'~,A*'°°,7r*'' 



(x*,A*,7r*) 



(c) 



o 

*i a 
< 



(x^",A^\7r") = (x*, A*,7r*) 



Augmented game in (60) 



(x, A, TTj) = game variables 
X = player's strategy profile 
(A,7ri) = virtual players' strategies 

(x*, A*,0 = NE 



Figure 1: Connection among G{X, 6), Vl{Zt, , and the sequence of games Gt{X, 0, A", tt"). Under the setting of Lemma 
[T] we have the following: i) Q[X,9) in (a) is equivalent to the "augmented" VI(-2t,^) in (b), where the local interference 
constraints {Jq(xq), g = 1, . . . , Q} are "relaxed" by introducing the multipliers A = (A,)^^j and tt is a variable of the VI; ii) the 
VI(-2t, ^) can be interpreted as a (compact) convex "augmented" game with no side constraints as represented in (c) [see (|60[l]. 
where there are Q real players, the SUs, and Q+1 virtual players who aim to optimize the multipliers Aq's and the price variable 
TTt; iii) a NE of the augmented game H60p. and thus the original game Q{X,6), is computed via best-response algorithms 
solving the sequence of regularized convex games with no side constraints GtiX ,0, ,Tr°) ^ ■■■ ^ gtiX,e,X°° ,7r!°) as 
shown in (d). 

A formal description of the above solution method is given in Algorithm[3]below, which provides the de- 
sired best-response based scheme solving the game Q(X,6); the convergence conditions are given in Theo- 
remfTOl In the algorithm we use the following notation: given (A", tt"), we denote by (x*(A", vr"), A*(A", tt"), 
7r*(A", vr")) the NE tuple of the game ^t(Af, 6, A", vr"), where we make explicit the dependence on the reg- 
ularization offset (A",7r"). 
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Algorithm 3: Best-Response Algorithm for Q{X,6) 



(S.O) 
(S.l) 
(S.2) 
(S.3) 



Choose any tuple (A'^,7r^) > 0, with A'^ = and some e G (0, 1); set n = 0. 

If (x*(A'',7rf), A*(A",7rj"), <(A'', vrf)) satisfies a suitable termination criterion: STOP. 
Solve the game 0, A", vrf ); let (x'^(A", vrf ), A*(A",7rf), <(A",7rf)) be the NE. 

Update the center of the regularization: 



A^+i ^ (1 



e)-A^ + 6-A*(A",7rr), q = l,... 



Q 



(62) 



.n+1 A 
t — 



(1 



e)-7rr + e-<(A",7rr). 



(S.4) : n ^ n + 1; go to (S.l). 



Theorem 10. Under the setting of Lemma \^ the sequence {(x*(A", vr"), 7r*(A"', tt"))} generated by 
Algorithmic globally converges to a NE ofQ{X,9). 



It is interesting to observe that Algorithm [3] converges under the same conditions introduced in Propo- 
sition [5] and guaranteeing the uniqueness of the x-component of the NE Q{X ,6); we refer to Corollary 
[6] and Sec. 14.31 for easier conditions to be checked as well as a detailed discussion on their interpreta- 
tion in terms of the system parameters. We discuss next some practical implementation issues related to 
Algorithm [3l 

5.2.1 Discussion on the implementation 

Algorithm [3] is conceptually a double- loop scheme wherein at each (outer) iteration n, given the current 
values of the regularization parameters (A", vr"), the SUs solve the game ^^(A', 0, A", vr") (with t > A™*^^) 
[Step 2], which requires an inner iterative process. Once the NE of Qt{X ,6 ,tt^ ) is reached, the reg- 
ularization parameters (A",7r") are updated according to (162p [Step 3], which represents the outer loop, 
and the new game Qt{X ,0^X^'^^ ,-k^~^^) is played again (if the convergence criterion in Step 1 is not met). 
In practice, however. Algorithm [3] is implementable as a single-loop scheme: the SUs play the game 
^i(A', 0, A", vr"), wherein from "time to time" (more precisely, when a NE is reached within the required 
accuracy) the objective functions of the virtual players are changed by updating the regularization terms 



In order to implement the aforementioned single-scale scheme, the following issues need to be addressed: 
1) How to solve each inner game ^((A', 0, A", vrf ) via distributed best-response algorithms? 2) How to 
update the regularization parameters in a distributed way? and 3) How to check the terminations of 
the inner process in Step 2— the SUs have reached a NE of the game ^j(Af, 0, A", vr") within the desired 
accuracy? We provide an answer to these questions next. 

On the inner game and price/multipliers update [Steps 2 and 3] 

Capitalizing on the solution methods that we developed in Sec. 15.11 for games with exogenous price and 
no side constraints, a natural choice for computing a NE of each ^^(A', 0, A", vr") in Step 2 of Algorithm 



Proof. See Appendix [Fl 



□ 



from f (Aq - A^) and f (vrt - vrf ) to f (Ag - A^+i) and f (vrt - 7rJ*+^), respectively. 
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[3] is applying those best-response asynchronous algorithms to ^((Af, 0, A'^, vrf). For instance, if a Jacobi 
scheme is chosen (cf. Algorithm [T]) , Algorithm |3] reduces to Algorithm 3] below, which sheds light on the 
signaling and complexity requirements of the proposed class of algorithms. In Algorithm U] we use the 
following notation: (x*(A,7fi), A*(A,7ff), 7r^(A,7ft)) denotes the NE of ^^(A', 0, A, vft), and 
is the Euclidean projection onto the interval [0, A™*^^], i.e., [a:;]o™'"' — max(0, min(2;, A™'^^)). 



in m 



Algorithm 4: Jacobi Best-Response-Consensus Algorithm for Q{X,d) 

(5.0) : Choose i) any arbitrary starting point (x^, A(°\ vrf^), with = (f^°\ p^°\ ) G 
y and {X^^\ttI^^) > 0; ii) any regularization tuple (A,7ft) > , and iii) some e G (0,1); set 
n = 0. 

(5.1) : If (x*(A,7ft), A*(A,7ff), 7r*(A,7ft)) satisfies a suitable termination criterion: STOP. 



1 ^ 

(S . 2a) : Run a (vector) consensus algorithm to locally compute the current values of — — 



(n) 



and / (x(")) = ^/^(xj"^) [cf. Algorithm [2] ; 

9=1 

(S.2b) : Update the players' strategies simultaneously: 



fn+l) 



G argmin \ £ 



(x„AS")),xLtvrr)} 



yq = l,...,Q 



An+l) 



a 



(63) 



(n+l) 



I (xW) 



a 



(S.3) : If (^x("+i), A("+^), vrf+^^j is a NE of GtiX, 6, X,Wt), then 
1) update the regularization tuple (A,7f(): 



A = A("+^\ Vg = 1, . . . , Q and Wt = 



(n+l). 



2) set (x*(A,7fi), A*(A,7fi), <(A,7fi)) = (x("+i), A("+i), 7rl"+^) 

3) n ^ ?i + 1 and return to (S.l). 
else: n ^ n + 1 and return to (S . 2a) . 



(64) 



The convergence analysis of the algorithm follows from that of Algorithm [3] (the outer loop) and 
Algorithm [1] (the inner loop) and thus is omitted. It is worth mentionig that Algorithm U] converges 
under similar conditions obtained for Algorithm [T| provided that a sufficiently large proximal gain a is 
chosen; this is not surprising, since the core of Algorithm |4] is the updating rule used in Algorithm [T| 
whose convergence conditions imply those of the outer loop (cf. Theorem llOp . We refer to Sec. IS.ll for an 
interpretation of these convergence conditions. 
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Algorithm [4] is mainly composed of two-subroutines: a consensus-based scheme [Step 2a] and a best- 
response update [Step 2b], both implemented locally by the SUs. More specifically, the inner game 
Qt{X,9,X,Tft) is solved in a fairly distributed way by following a two-steps procedure. Fist, in Step 
2a, the SUs run a consensus algorithm to locally acquire the global information required to perform the 
update of their sensing/transmission variables as well as the multipliers Ag's and the price ttj, which 
is represented by the average sensing time (l/Q) Yl'q=i ^q^'^ / \fTq) ^-iicl the global level of interference 
I(x(")) = Ej=i^g(4"^) generated at the primary receiver; this procedure requires an exchange of infor- 
mation among neighboring nodes, as already discussed in Sec. 15.11 where we refer for details. Once the 
aforementioned information is available at the secondary transmitters, each SU q locally updates his own 
sensing/transmission strategy Xg as well as the multiplier \q and the price vr^, according to (|63p [Step2b]; 
he just needs to measure the MUI experienced at his receiver and solve his own optimization problem. 
Note that: i) the updates of the multipliers Ag's and the price vr^ have an explicit closed form expres- 
sion, and thus are computationally inexpensive; and ii) there is no need of a centralized authority for the 
optimization of the price tt^, which is instead updated locally by each SU. 



On the inner termination criterium [Step 2] 

The only issue left to discuss is how to check the termination criterion of the inner process in Step 2 
of Algorithm [3l similar discussion applies to Algorithm 21 In practice. Step 2 is terminated when the 
NE (x*(A",7rf), A*(A",7rf), 7r*(A",7rf)) of 0, A", vrf ) is reached within the prescribed accuracyQ 

say where we let e^") to depend on the (outer) iteration index n. Stated in mathematical terms, 
this means that the players leave Step 2 as soon as their current strategy profile (x. A, vtj) satisfies the 
following inequality: 

X 

A 



x 



A*(A",7r: 



(A" 



< e 



(n) 



(65) 



where ||«|| is any vector norm. Denoting by z = (x. A, vtj) the players' strategy profile and by Sgj(A"',7r") 
the (unique) NE of Qt{X ,0,\^ ,tt^), which depends on the regularization parameters (A",7r"), the stop- 
ping criterium in (f65l) can be equivalently written as ||z — Sgj(A",7r")|p < Using the above nota- 
tion/terminology. Step 2 of Algorithm [3] reads as 



(S.2a) : Solve the game ^t(^Y, 0, A", vr") within the accuracy e^"): find a z = (x. A, TTf) such that 



Sa(A" 



<,(n). 



(66) 



(S.2b) : Set (x*(A",vrf), A'^(A",^?), <(A", vrf )) = z. 



In general, the test in (j66p would require some coordination among the players; nevertheless, we suggest 
next two simple distributed protocols to do that, building on the error-bound analysis of Vis Ch. 6]. 

Observe preliminarily that an error bound on the distance of the current strategy profile z from the 
NE S(jj(A"', vr") can be obtained by solving a convex (quadratic) problem (see, e.g., [351 Prop. 6.3.1], |35| 
Prop. 6.3.7]). Indeed, under the convergence conditions of Algorithm [3] [cf. Theorem I10|. one can write 



*Recall that, under the convergence conditions in Theorem 1101 each Qt{X, 9, A, 7rt) has a unique equihbrium, 
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each game ^^(A', 0, A", vr") as a (strongly monotone) VI problem, for which the following error bound 
holds [35] Prop. 6.3.1]: a (finite and absolute) constant > exists such that for every z, 



|z-Sg,(A",7rr)f <r/||*Lt(z) 



(67) 



with 



nat 



and n^;, 



Vx,/^g((Xg, Ag), X_g,7rt)) 

^max 

^glXg) 



a 



Q 



g=l 



V 



/(x) 



Q 



(68) 



V [ 



nat 



Z)] 



Q+1 



I denoting the Euclidean projection of the vector a onto the closed and convex set 3^^, where 
in (I68p we made explicit the partition of ^'nat (^) in Q + 1 (vector) components, ([^^^t (^)]<?)^/) each 
of the first Q being associated with one different player q. The important result here is that each SU q 
can compute his own component [^^at '^^^^ ^ the last component [^^at (^)]q+i) efficiently and 

locally. Indeed, capitalizing on the information already acquired for the computation of the best-response, 
he just needs to solve a quadratic programming [corresponding to the evaluation of the projection Ily^ (•)], 
for which no extra signaling/coordination with the others is required. 

A simple application of the error bound (j67p for the test in (j66p is to let each SU q to choose a 
local termination error Eq < 7] ■ e/Q, with e = e^"^ being the desired accuracy in (j67p . and perform the 
termination criterion || ['^'nat + II ['^nat (^)]q+i 11^ — ^<?' which is locally implementable, provided 

that an estimate of the absolute constant ry in (|67p and the number of the active SUs can be preliminary 
obtained. 

When this information is not available, one can consider a variation (inexact version) of Algorithm 
[3l Instead of solving each game ^t(^, 0, A", vr") exactly, the players compute at every stage n only an 
approximated solution of ^^(Af, 0, A", vr") that becomes tighter and tighter as the iteration in n proceeds. 
Stated in mathematical terms, we have that the sub-iterations in Step 2a are terminated according to a 
prescribed error sequence that progressively becomes tighter as the iteration in n proceeds. For 



instance, a suitable termination sequence in (|66p is any {e'"'}„, C [0,oo) satisfying X^^i e'"' < oo; since 
the latter condition implies e^") J, 0, when the iterations n progress the NE Sg^{X^ ,Tr^) will be estimated 
with an increasing accuracy. One can show that the aforementioned inexact version of Algorithm [3] 
converges under the same conditions given in Theorem [TUJ we omit the details because of the space 
limitation, and we refer to |22| for a similar approach valid for convex games. The termination protocol 
for the inexact version of Algorithm [3] is then the following. Each player q choses preliminarily a suitable 



local termination sequence {£q^^}n C [0, oo) such that ^ 
player q becomes then IH^'nat (^)]gll^ + 



.in) 



< oo; the termination criterion of each 



'^nat (^)]q+iII'^ — ^q^\ which Can be locally implemented. Once 
the desired local accuracy is reached by all the players, they can all update the center of their regularization, 
according to (j62p . This protocol guarantees that the resulting sequence e^"^ = Yl^=i ^^q^ ™ ()66p will satisfy 
the required condition Yln^=i ^^"^ < without the need of any information exchange among the players. 



'An explicit expression of r) can be obtained as a function of the system parameters, based on |35l Prop. 6.3.1]. 
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The last issue to address for a practical implementation of the two protocols above is to understand 
how the players can know that also the others have reached the desired termination criterion. This can 
be done by exchanging one bit of information; otherwise each user can just update his regularization after 
experiencing no changes in ||[^nat (^)]gll ||[^nat (^)]q+iII f^^' ^ prescribed number of iterations. 

Two last comments about the proposed class of algorithms solving Q(X,9) are in order. To obtain 
decentralize algorithms even in the presence of global (nonconvex) interference constraints, we have intro- 
duced multipliers and relaxed the global constraints. As a side effect of the proposed approach, we have 
that global interference constraints are met only at the equilibrium of the game; implying that during 
the iterations of the algorithms they might not be satisfied. This issue is alleviated in practice by a fast 
convergent behavior of the proposed algorithms, as shown in Sec. [6l Note that this issue is quite common 
to many power control algorithms subject to QoS or coupling interference constraints (see, e.g., [48) and 
references therein). Finally, we wish to point out that when the sufficient conditions for the convergence 
of the proposed algorithms are not satisfied, still we can claim some optimality property for the proposed 
algorithms, namely: every limit point of the sequence generated by the our algorithms is a quasi-NE of the 
game under consideration; the analysis of such relaxed equilibrium concept along with its main properties 
is addressed in the companion paper |26| . 

5.3 A bird's-eye view 

In the previous three sections we proposed several distributed algorithms to solve the general game G{X, 6) 
and its special cases. The algorithms differ from computational complexity, performance, and level of 
signaling among the SUs; making them applicable to several different scenarios. It is useful to summarize 
the results obtained so far, showing that, in spite of apparent diversities, all the algorithms belong to a 
same family; Figure [2] provides the roadmap of the proposed distributed solution methods along with the 
signaling required for their implementation. 

6 Numerical Results 

In this section, we provide some numerical results to illustrate our theoretical findings. More specifically, we 
first compare the performance of our games with those of state-of-the-art decentralized [21] and centralized 
[14] schemes proposed in the literature for similar problems; such schemes do not perform any sensing 
optimization using thus all the frame length for the transmission, and the QoS of the PUs is preserved by 
imposing (deterministic) interference constraints (we properly modified the algorithms in [14| to include the 
interference constraints in the feasible set of the optimization problem) . Interestingly, the proposed design 
of CR systems based on the distributed joint optimization of the sensing and transmission strategies 
is shown to outperform both centralized and decentralized current CR designs, which validates our new 
formulation. Then, we provide an example of signaling/performance trade-off, showing the throughput 
gains achievable by the SUs if the sensing time is included in the optimization. Finally, we focus on the 
convergence properties of the proposed algorithms. 

Example ^1: Comparison v^ith state-of-the-art algorithms. In Fig. [3l we compare the perfor- 
mance achievable by the proposed joint optimization of the sensing and the transmission strategies with 
those achievable using the sum-rate NUM-based approach subject to interference constraints [14] and the 
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Game g(X,9) [cf. {23)-(24)] 




Game Q^^olX. e) [cf. (48)] 

- only local interference constraints 

- fixed (eqLu)sensing times r^^ = t for all q 



Game Q,(X,e) [cf. (48)] 

- only local interference constraints 

- optimization of the sensing times 



Game Q(X, 6) in the general form (23)-(24) 



■ local/global interference constraints pins pricing 

■ optimization of the sensing times 



Solve the sequence of games in (61) 

Q,(x,e,x°,Til) — > gt{x,e, x°°,Tr^) 

using Algoritlmis 3 (or Algorithm 4) 

- signalhng among the players in the 
form of consensus (Algorithm 2) 



Figure 2: Road-map of the proposed algorithms solving Q[X,9) and its special cases along with the resulting sig- 
nalling/optimization tradeoff. 

game theoretical formulation in |21| . More specifically, we plot the (%) ratio [SRqe — SR)/SR versus 
the (normalized) interference constraint bound pjl'^^^ (p^ = = p for all q ^ r and = /max 

all q), for different values of the SNR detection snr^ = aj ^/c^^, where SRqe is the sum-throughput 
achievable at the (Q)NE of the game Q^^q[X, 6) (local interference constraints only), whereas SR is either 
the sum-rate achievable using the scheme in |14) (red line curves) or the sum-rate at the NE of the game in 
|21) (black line curves). We simulated a hierarchical CR network composed of two PUs (the base stations 
of two cells) and ten SUs, randomly distributed in the cells. The (cross-) channels among the secondary 
links and between the primary and the secondary links are FIR filters of order L = 10, where each tap 
has variance equal to l/L?", the available bandwidth is divided in = 1024 subchannels. From Fig. [H 
we clearly see that the proposed joint optimization of the sensing and transmission parameters yields a 
considerable performance improvement over the current state-of-the-art CR centralized and decentralized 
designs, especially when the interference constraints are stringent. 

Example i^2: Sensing time optimization. Fig. H] shows an example of the achievable throughput of 
the SUs when the sensing time is included in the optimization. More specifically, in the picture, we plot 
the (normalized) sum-throughput achieved at a (Q)NE by one player of the game versus the (normalized) 
common sensing time, for different values of the (normalized) total interference constraint (the setup is 
the same as in Fig. [3]). In the same figure, we plot also the sum-throughput achieved at the (Q)NE of the 
game Q.^=q{X ,9) (square markers in the plot), where c is set to c = 100. According to the picture, the 
following comments are in order. There exists an optimal duration for the (common) sensing time at which 
the throughput of each SU is maximized, implying that the SUs can achieve better performance if some 
(limited) signaling is exchanged in order to optimize also the sensing time. Second, as expected, more 



Solve Qt;=q{X^ 6) using Algorithm 1 



- asynchronous implementation 

- no signalling among the players 



Solve Qt^[X.9) using Algoritlmi 1 

- asynchronous implementation 

- signalhng among the players in the 
form of consensus (Algorithm 2) 
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stringent interference constraints impose lower missed detection probabilities as well as false- alarm rates; 
requirement that is met by increasing the sensing time (i.e., making the detection more accurate). This is 
clear in the picture where one can see that the optimal sensing time duration increases as the interference 
constraints increase. Third, the proposed approach based on a penalty function leads to performance 
comparable with those achievable by a centralized approach that computes the optimal common sensing 
time based on a grid search. 

Example :^3: Algorithms for Q^^^q^X ,6) (local constraints only). In Fig. [5l we plot an instance of 
the sequential and simultaneous best-response based algorithms, proposed in Sec. 15.11 to solve the game 
Qt^{X,6) in (j48p . with vr = (cf. Algorithm [T]) . We considered the same setup as in Fig. 21 but with 
15 active SUs; the SNR detection snr^ = a'j ^/c^yt is set to snr^^ = OdB, for all q and k; the SNR of 
the SUs snrg^fc = Pq/a'^{k) is snrg^^ = 2dB for all q and k, and the (normalized) inter-pair distances 
dqr/dqq > 3 for all q ^ r, with dqr denoting the distance between the receiver of SU q and the transmitter 
of SU r, which corresponds to a "low/medium" level of interference among the SUs; the bounds Uq^k and 
I3q^k are both equal to 0.5 for all q and k] and the constant c is set to c = 100. In Fig. [5)^ a), we plot the 
opportunistic throughput evolution of the SUs' links as a function of the iteration index, achieved using 
the sequential best-response algorithm (solid line curves) and the simultaneous best-response algorithm 
(dashed line curves); whereas in Fig. [5]Jb) we plot the evolution of the optimal (normalized) sensing times 
of the SUs versus the iteration index. To make the figures not excessively overcrowded, we report only 
the curves of 3 out of 15 links. As expected, the sequential best-response algorithm is slower than the 
simultaneous version, especially if the number of active links is large, since each SU is forced to wait for 
all the users scheduled in advance, before updating his own strategy. However, both algorithms converge 
in a few iterations (this desired feature has been observed for different channel realizations), which makes 
them appealing in practical CR scenarios. Observe also that, thanks to the penalty term on the sensing 
times in the objective function of each SU, the algorithms converge to the same optimal sensing time for 
all the SUs [cf. Fig. E^b)]. Roughly speaking, these algorithms share the same features of the well-known 
iterative waterfilling algorithms solving the power control game over ICs |18| [T9l [20l [2H I22| . 

Finally, observe that, even when the theoretical convergence conditions we obtained are not satisfied, 
still we can claim that every limit point of the sequence generated by our algorithms is a QNE of the 
game. 

Example #4: Algorithms for Q{X,G) (global constraints). In Fig. [6] we tested the convergence 
speed of Algorithm 1 applied to the game Q(X,6) in the presence of global interference constraints. The 
system setup is the same as the one considered in Fig. [5] for the low/medium interference regime, with 
the only difference that now, instead of the overall bandwidth interference constraints ([7]) , we assume that 
the PUs impose the global interference constraint ([8]); for the sake of simplicity we considered the same 
interference threshold for both the PUs. In Fig. [6l we plot the opportunistic throughput evolution of 4 
(out of 15) SUs' links and the worst-case average violation of the interference constraints as a function of 
the iteration index (counted considering both the inner and the outer iterations) , achieved using Algorithm 
31 As expected. Fig. [6l shows that the algorithms proposed to solve the game Q{X, 6) with side constraints 
require more iterations to converge that those used to solve the game Qt^=q{X,9). On the other hand, 
global interference constraints impose less stringent conditions on the transmit power of the SUs than 
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those imposed by the individual interference constraints, implying better throughput performance of the 
SUs (at the price however of more signaling among the SUs) 
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Figure 3: Comparison of proposed joint sensing/transmission optimization with state-of-the-art NUM (cooperative) 
and game theoretical (noncooperative) schemes where no sensing is optimized: (%) ratio (SRqe — SR)/SR versus 
the (normalized) interference constraint bound P/I'^^^. 
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Figure 4: Normalized throughput versus the normalized sensing time, for different values of the (normalized) 
interference threshold. The square markers correspond to the (Q)NE of the game G{X, 9), achieved with c = 100. 
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Figure 5: Example of convergence speed of sequential (solid line curves) and simultaneous (dashed line curves) 
best-response based algorithms applied to the game Q-k{X , 6): Secondary users' opportunistic throughput (subplot 
a) and normalized sensing times (subplot b) versus the iteration index. 
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Figure 6: Algorithm 1 applied to game G{X, 0): Opportunistic throughput and average interference violation 
versus iterations (outer plus inner loop) . 
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7 Conclusions 



In this paper, we proposed a novel class of noncooperative games with (possibly) side constraints, where 
each SU aims to maximize his own opportunistic throughput by choosing jointly the sensing duration, the 
detection thresholds, and the vector power allocation over SISO frequency-selective interference channels, 
under local and (possibly) global average probabilistic interference constraints. In particular, to enforce 
global interference constraints while keeping the optimization as decentralized as possible, we proposed a 
pricing mechanism that penalizes the SUs in violating the global interference constraints. The proposed 
games belong to the class of nonconvex games and lack boundedness in some of the optimization variables, 
which makes the analysis quite involved. A major contribution of this paper was to introduce a new 
methodology for studying the existence and the uniqueness of the solution of nonconvex games with 
side constraints and design distributed solution algorithms. The proposed class of algorithms spans from 
noncooperative settings modeling selfish users to cooperative scenarios where the users are willing to 
exchange limited signaling (in the form of consensus algorithms) in favor of better performance. Numerical 
results showed the superiority of the proposed design (in terms of achievable system throughput) with 
respect to the state-of-the-art centralized and decentralized resource allocation algorithms for CR systems. 
Together with their fast convergence behavior, this makes them appealing in many practical CR scenarios. 

Appendix 

A Proof of Proposition [2] 
A.l Intermediate results 

To prove the proposition we need two intermediate results, stated in Lemma [11] and Lemma [12] below. 
Lemma [TT] proves that the Abadie Constraint Qualification (ACQ) holds true at every (nontrivial) optimal 
solution of (I32p . which implies that any of such solutions must satisfy the KKT conditions associated 
with (f32|) . Lemma [T2] proves the boundedness of the multipliers A* associated with the local nonconvex 
constraint /(x*) < at any solution x* of (|32p . 

Lemma 11. The ACQ holds at every feasible solution of problem i32\) . 

Proof. The proof follows similar steps of |26| Prop. 8] and thus is omitted. □ 

Lemma 12. Let x_g G y^q and ivt € St for some t > 0. At every solution x* of I^EW' ^''^V optimal 
multiplier A* associated with the constraint /(x*) < satisfies A < X"^^^, with X"^^^ defined in \42^ . 

Proof. First of all, observe that the nonconvex problem (I32p admits a solution x* = (f^, p*, P^*^, for 
every given x_g E V-q ^iid vrt G St\ by Lemma [TT| x* must satisfy the KKT conditions of the problem, 
for some multipliers A* associated with the constraint /(x*) < 0. Given the KKT conditions (which are 
omitted here), starting from the complementarity of the ^-variables, summing over fc, and invoking the 
orthogonality condition, we obtain: denoting by x* and ^* ^ the multipliers associated to the power budget 
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and the spectral mask constraints, respectively, 
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Y.'^gAPlk^P- 



crl, + Y,\H,r{k)\'pr,k+\H,g{k)\ 

r^q 



(69) 



< 



min < log 

i<fc<Af 



1 + 



V 



al{k) + Y,\H,r{k)\^pr''{k) 



A \ max 



min {(T„u\ 



where in the last inequality we used the following property of the logarithmic function, which is an 
immediate consequence of its concavity: for any scalar a > and c > 0, it holds that log(l + cy) > 
y log(l + ca), for all y G [0, a]. Inequality in (j69p together with the complementarity conditions associated 



to the power constraints p* < p™*^^ and Ylk=iPnk — -^<?' ^^'^ individual nonconvex interference 



constraint /g(x*) < lead to 



N 



\^q + Xqi^q+ Z^^q,kPq,k ^ 



max 
9 



k=l 



(70) 



The desired result A* < X^^^ follows from (j70p and min <^Pq, min^ {p™^^} | = min^ for q. □ 

A. 2 Proof of Proposition [2] 

The proof is organized in the following two steps: 

Step 1. We show first that under the assumptions in the proposition, each problem (132]) has a unique 
optimal solution, for any given x_g € 

Step 2. Then, we prove that any optimal solution of (|32p is nontrivial. 

Step 1. Given x„g G y^g and vr^ G St, let = (r^, p*, Pq^*) be a solution of (f32|) : by Lemma [TT] 
there exists a multiplier A* such that (x*. A*) satisfies the VI(/Cg,Fg) in (j40p : by Lemma [121 it must be 
A* < A™^^. It turns out that to prove Proposition [2] is sufficient to show that, under the condition in the 
proposition, the VI(/Cq,Fq) has a unique solution in the Xg- variables. 

Suppose by contradiction that there are two distinct solutions of the VI(/Cq,Fq), denoted by y^^^ = 
(x^^\ A^^^) G % X [0, A'"^^] and yf ^ ^ (xf \ A^ ^) G x [0, A'"^^], with x^^^ / xf ^ Then, we have 

(yr^-yJ^^)'^F,(y«;x.„ .,)>0 
(y^^^-yf)^F,(yf;x_„ n)>0. 
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Summing the two inequalities yields to 



< 



- v(2) 
J q J q 



Invoking the mean-value theorem applied to to the univariate, differentiable, scalar-valued function 

T 



5 G [0, 1] ^ (y« - yf j Fg (y,(5); x_„ vr^) ; 
we deduce that there exists some < 5 < 1, such that (f7T]) can be written as 



< -fyW-yf) (F,(y«;x„„vr,)-F,(yf;x„„ 



(yj'^ - yf Jy,F, (y,(5); x„„ vr,) (y« - y 



(2) 



X(l) x(2) 



V2 ((X,(5), X_,), TTt, X,i5)) , Vx,/, (x,(5)) 



-Vx,/,(x,(5))^ 



X 



(1) ^(2) 



x(2) 

Af, — An 



(x« - x(2))^ V^^/:, ((x,(5), x_,), vTi, A,(5)) (xW - X, 



(2) 



(71) 
(72) 

(73) 
(74) 

(75) 

(76) 



where in (j73p JygFq( • ; x_q, tt^) denotes the Jacobian matrix of Fq(-;x_g, vrt) with respect to y^ = 
(xg, Xq). Since Xg(5) G (recall that is a convex set) and Xq{6) < A™^^, the inequality in ()76p 
contradicts the positive definiteness of V^^jCg ((xg((5), x_g), vrt, Aq(^)), as assumed in Proposition [2] 

Step 2. To complete the proof it is enough to show that the Pg-component of any optimal solution 
X* = (r^, Pg, Pq^*) of (j32p is such that ^^i^Pqik) is lower bounded by a positive constant; see Lemma [13] 
below. To state the lemma, we need the following intermediate definitions. Let p^'^^ = [p^q]^) G Vq be any 
tuple such that 

J2\^p,qik)\'fq% < 2/^^ (77) 

k 

so that for all pairs (fq, P^) satisfying (jl2p (b). the interference constraints (jl2p (a) evaluated at (r^, p^'^^, P^) 
hold; and let 



' ^ max <^ Q 



(y. 



Uk\l (^q,k + (^g,fc|l " /^g,fc|o) \/fq 



„rcf 



(78) 



Note that, under the feasibility conditions ([25]) . such a Pq'^''" satisfies [see ([T2l) fb)] 

^ ^^q,k\^ ~ f^q,k\0 



<yq,k\l 

for any Tq > sj fq r™'°. Finally, given t > 0, let 



(yq,k\l 



< (Xqh, = I,. . . ,N, 



(79) 



r^fit) A log ( 1 _ !_ ) + log (l - + log ( 5^ r,,, (pf , p 



..max\ 

-9 ) 



t 

2 Vfc^ 



max^{|Gp,,(A.)|V-i}J. 

(80) 



We can now introduce Lemma [13] that provides a lower bound for the optimal sum-power allocation of 
each player. 
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Lemma 13. Given t > 0, and feasible irt G St, P-q G V-q and % G \/ fr r™"^, vTr^^^™^ /o?^ all r ^ q 
the power-part p* o/ any optimal solution of the q-th nonconvex optimization problem in MS^) satisfies 



N 



k=l 



(81) 



Proof Let t > 0, TTt e St, < Pr < P™^"" with r 7^ and tv for r / g satisfying G ^/J^r^, VXr™^ 
be given. Let define 



1 



- ^ E,.., we then have < < ± Ei. ^ < 

Therefore, if x* = (t?, p*, P1^*) is player g's best-response corresponding to vr^, T_g, and p_g, then 



i?, ( rf, (pf , p_,), ") - vr, • J]P,T(ff , ") |Gp,,(fc)| 



2„ref 



< p, (^^, (p^,p-,), p^) - TT, • j;p,x(Tf , pf") lGp,,(fe)pp: 



< log ( j;r,,,,(p^,p_,) J < log ( ^log ( 1 + 1^ ] ] < log ( 5] 



q,k ^ 




1 \ 1 ^ r 



(82) 



where Rq {jq, p, Pj*^) and rg^fc(pq,p_g) are defined in ([5]) and ([6]), respectively. On the other end, we 
have: 

Rq hf, (pf ,p-,), Pf ") - vr, • 5]P°r(ff , Pf ") |Gp,,(fc)pp-f > nf{t), (83) 



with r/^^^(t) defined in ([HOD, and in ^ we used vrt G St and P™;!^^ < 1/2. The desired bound in ^ 
follows readily from ^ and ([83]). □ 



B Proof of Corollary [3] 



The proof is based on the following two steps. 

Step 1. We introduce a symmetric matrix, denoted by V^^/Ig G ]R(^+2)^(^+2), having the property that: 
given t > 0, 

hlCq (x, TTt, Xq)) J > lyf V^T^Iyl V (x, TTt, Xq) € y X St X [0, X"'^^], and y G (84) 



which guarantees that V^^Cq (x, vr^, Xq) is positive definite if V^^Cq is so. 
Step 2. We derive sufficient conditions for 'V^^Cq to be positive definite. 

Step 1. It is not difficult to see that is satisfied if V^^Cq is built such that: for all (x, irt, Xq) G 
y X [0,t] X [0,A'"^^], 



< 



< 



VL A (X, TTt, Ag) 



VlCq (X, TTt, Aq 



if i = j, 
if i / j. 



(85) 
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To construct such a matrix, we need to bound properly the entries of V^^Cq (x, nt, Xq). Recahing that 
V'^^Cq (x, TTj, Xq) has the fohowing expression [cf. (|41|) ]: 

VlCq (x, n, Xq) ^ -VlBqi^) + A, • /,(x,) + ^ • /(x) (86) 



we focus next on each term in (|86p separately. 
—Matrix — ^ij(x): Introducing 



TV TV 



max 



fc=l fc: 

with Vq^kip) defined in i^, — ^ij(x) is given by 
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We provide now some bounds of the above quantities that will be used to define the diagonal entries 
of Vx^jCq. The minimum eigenvalue of the positive definite matrix Vp^ (— logrq(p)) is lower bounded by: 

foranpG^ = n?=in, 



Vp, (-logr,(p))j > ^jnin^ 



^min 
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(92) 
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whereas a lower bound of the first and last diagonal elements in ([88|) are: for all feasible (jg, P^^) [see 
conditions (b) and (c) in (|12|) ]. 
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where we used the following lower bound of P^^: P^^ > min/j Q 



(/^g,fc|l ~ /^g,fc|o) \/^ 



A pfa 



This bounds will be used to define the diagonal entries of the candidate matrix V'^^Cq. 
—Matrix /^(xq). Let introduce first the following quantities and their associated bounds: 
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s> fc = ' _ and Wptapta i. 



pmiss/'^ pfaA 
^g,fc Vg' ^g i 



(97) 

N 



■q, ^ ^ d{p;:'r 

which can be collected in the vectors cjf^ = {^9g,k)k=i^ i^pia — pt^. ^k)k=i^ ^TqP^'^ — ("^r, pf»,fc)fc^=i 

,A/', , \N onri / .max A /, ,max\A'' , .max A (, .max \N , .max A /, .max \N ^?^■nr,^^,r 

^pi-pi- - [^pi-p(-,k)k=i^ and u^^ - {uj^^^f^)^^^, Upf, - [ujpt^i^)f,^^, ^r,pt- " y^fgPt-,k)k=i- ^ maliy, 
we introduce the column vector Gp^q = {\G p^q{k)\'^)^_^ of the cross-channel transfer function between the 
secondary transmitter q and the PU, and the notation a© b = (ofc • bk)k=i for given a = {ak)k=i and 
b — {bk)k=i- Then, matrix Iq{^q) can be written as 



/,(x,) = 2 



l^vect (wr,f, Gp,g p^) , vect(u;y^ Gp,,)^ , l^vect (t^- p^c?) Gp, Pg 
vect (wy,^ Gpg) , OjvxA', vect (^w^^c) Gp,g^ 

l^vect {uJ^^p^,) Gp,g p,^ , vect Gp,^ , I'^vect (u^pMp^ Gp, p. 



(98) 



Based on (I98p . let us introduce the matrix 



Vx^/g(xg) obtained from Vx^/q(xg) by setting to zero 



the diagonal terms ([AJ^g denotes the off-diagonal part of the matrix A) and take an upper bound of its 
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off-diagonal entries (the inequalities below have to be intended component-wise): 

vect(wy^ Gp,q)^ , l^vect {t^~^pM Gp,, p, 
Oatxjv, vect (i^pi^i) Gp,q 

l^vect (i^^f^pi-,) Gp^g pq'j , vect (u}pM Gp,,^ , 
0, vect (ujf]''''^ ^ , l^vect (o;^^^,., p™^'^) 



^2 
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vect (ujf Gp 
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vect 
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off ' 



vect ' - 




(99) 



—Matrix Vx -^(x). Following similar steps as for (|99p . we obtain 



V2^/(x) J^^<max{|Gp,,(A;)|2} 



up 
off ' 



(100) 



We are now ready to introduce the matrix V^^£g satisfying (|85p . Given t > 0, and the definitions in 
Mll-dMl) and (fTOOll . we define 

VlT, ^ Diag{(d?-, {d^fo,rq,k)Li, d^f)} - 2 • max{t, A^-} • max { |Gp,,(fc)p} • [vij,]^ (101) 



Step 2. It follows from Step 1 that V\^Cq in (llOip satisfies the desired property (f84|l . Condition (|43]l of 
the corollary is readily obtained by imposing that Cq is row-diagonal dominant, and setting 
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2 max(t, A" 
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, mm 
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(102) 



N+2j 



} 



□ 



C Proof of Theorem [5] 

To prove the theorem we need the following lemma whose proof follows the same idea of that in Lemma 
[T2] and thus is omitted. 

Lemma 14. Let t > X""^^, with A™"""" defined in (g^. Then, at every solution (x*, A*, vr*) of the VI{Zt, *) 
defined in ((^5[ ) [stationary solution ofQt{X, 9)\, the price constraints i30\) are not binding, i.e., vr* < t. 

Proof of Theorem [5l We prove only statement (a); the proof of the second part (b) follows similar 
steps of those in the proof of Proposition [2] and thus is omitted. Given t > t*, under the assumptions in 
(a), Proposition [3] states that the game Qt{X, 0) admits a nontrivial NE (x*,7r*); by Lemma W\\ there 
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exist multipliers A* such that (x*, A*,7r*) satisfies the Yl{Zt,^) in (j45p [or equivalently (|44p ]. Lemma [T^ 
shows that the upper bound constraint on the price in St is not binding at (x*,A*,7r*), implying from 
iii) of (gH) that ij* = and thus < vr^ _L -/(x"") > 0. Hence, (x*,7r*) must be a NE of the original 
un-truncated game 0) [recall that, under the positive definiteness of the matrices Vx^£g(x, vtj, \q) 

on 3^ X 5f X [0, A™^^], each optimization problem in ([32]), with x_q = x*^ and vr^ = vr^, has a unique 
stationary (and thus optimal) solution, which then must be equal to x*; see Proposition [2] . □ 



D Proof of Corollary [6] 

In order to obtain more general conditions than those in Theorem [S] by Lemma [131 we can restrict the 
check of the positive definiteness of the matrices Vx^>Cq(x, vr^, Ag) and A(x, A, vrt) as required in Theorem 
[5|to the subset of the feasible set where any solution of the game lies. More specifically, let us introduce the 
restriction of the sets Vq and yq defined in ([¥]) and (|19p . respectively, to the power allocations satisfying 
([MI: given t > 0, 
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t A 



y 



t A 




(103) 
(104) 



where 3^* is defined as yq in ()19p . but with Vq replaced by Vq. By Lemma [131 instead of checking the 
positive definiteness of V'^^Cq{x, nt, Xq) and A(x, A, -Kt) on the feasible set 3^ x 5t x [0, A™'^^], we can 
restrict this requirement to the subset y^ x St x [0,A™'*^]. 

We can now prove the corollary. We show next that (j47p are sufficient conditions for the matrix 
A(x, A, TTt) to be positive definite on 3^ x 5^ x [0, A™^"^]. Fist of all, observe that matrix A(x, A, vr^) can 
be written as 
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-A(x,A,7rt)|^^o 



+ c(l-l/Q) 
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-1 








(106) 



up to a permutation 

where Dj^ = diag | (y'^/g)^^!- Since the matrix in (|106p is positive semidefinite, we can focus only on 
A(x, A, 7rt)|^^Q. To obtain a sufficient condition for A(x, A, 7rt)|^^Q to be positive definite on 3^* x 5t x 
[0, A™'^^], we follow a similar idea of that in Corollary [3l Namely, we build a proper matrix A such that, 
for some t > X"'^"^, 



(A(x, A, 7rt)|^^o) y > lyf A|y| V(x, n, X) e y' x St x [0, A"""^], and y G 



f {N+2) 



(107) 
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To this end, we focus on each term in (jlOSp separately and derive proper bounds. 
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Using the fohowing lower bound for the rate function rq(p): given t > and p^ G V\ 
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where the second inequality follows from LemmadH we have for 
and pr G [0, p™'"''] with r ^ q, 
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N 



k=l 



■ vect 



N 



\Hqrik)\'^ Pq,k 



N 



k=1 



•vect 



\HqrikW 



fe=l 



up 



(114) 
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which leads also to 



V^,p, (-logr,(p)) 



< max 



\H,r{k)? 



k=l,...,N a 



1 1 

+ 



max 



q,k 



rl°"(t) r^°^{t) r™n k=i,...,N \ a 



[115) 



q,k 



Using V| Cq defined in (|10ip . we are now ready to introduce the matrix A, defines as: given t > 0, 



A^(A,,)^^^^ with A 



qr 



— Diag ■ 



0, 



V72 n 



up 



, 



if g = r, 
I , otherwise, 



which satisfies the desired property in (I107p . 

A sufficient condition for (I107p can be obtained as in (j47|l . by imposing that (the symmetric part of) 
A is row diagonal dominant. More specifically, introducing 



^117) 





\ 1 


I 1 1 




max < 


max < 






q=l,...,Q 


k=l,...,N 







the diagonal dominance conditions is: for each q = 1, . . . ,Q and i = 1, . . . , N , 



r^q i=i 





up / 










+ ( 




up^T' 


< 













> m 

2 ^ U=i 

r^q 



max 



Hqr{k)\' 



,N a 



max 



\Hrq{k)\' 



q,k 



k=l,...,N a 



r,k 



up 



After substituting the explicit expression of 
the desired condition (j47p . where we defined 7^ ' as 

^(2) A ^max(,^ . ^(1) 

with jq^^ given in ()102p and 

at) 1 



(118) 

and doing some manipulations, (jllSp leads to 

(119) 



cr^'it) - 



2t 



where [V^ Iq]^^ and ({t) are defined in (j99p and (|117p . respectively. 



N+2j 



(120) 



E Convergence of Asynchronous Best-Response Algorithms for Q^i^, 

In this section, we study the convergence of asynchronous best-response algorithms solving the game 
Qt^{X,6) in (|48p : an instance of such algorithms is represented by Algorithm [TJ Since the study of 
convergence is based on contraction arguments of the best-response map associated with game Q-,^{X,9), 
we derive first sufficient conditions for this best-response to be a contraction; see Sec. lE.ll We then 
provide the main theorem stating convergence of the asynchronous best-response algorithms; see Sec. IE.2I 
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E.l Contraction properties of the best-response of Qj,{X,0) 

Before introducing the main result of this section, we need the following intermediate definitions. Given 
Cq defined in ([37|) . let Bg (x, Ag, nt) be the 2x2 matrix, defined as 



Bq (x, Ag,7rt) 



V^ Cg (X, TTt, Xg) 



c=0 



^l(p„pfa)^^ ^'/) 



g ; '1 



V^p^ pfa):^^/^g (x, TTt, Ag) , Aleast (V^p^ pf^^£g (x, TTt, Ag) 



(p. 



r 



(121) 



where ||A|| = p {^A^^ A)^^^ and Aieast(B) denote the spectral norm of A and the minimum eigenvalue of 
the symmetric matrix B, respectively. Given t > and 3^* as defined in (jl03p (cf. Appendix [D]), we also 
introduce 



Pg{t) 



0, 



min {Aieast (Bg (x, Ag, vrt))} , if pg(t) > 0, 

(x Ag)G5?*x[0,A--] 
(x_g,7rt) G y^g X St 



(122) 



otherwise; 



and the diagonal matrices Dg(t, c) and Eg^ (x) 



Dg(t, c) 

'O ' Pg(t) 

with ^g(-) defined in pT|) . Given the coefficients 



and Egr (x) 



0, 



V2 

PgPr 



^,(x) 



(3qr(t, c) = max ||Dg(t, c) ^ Eg^ (x) Dr(t, c) ^1 

{Xg,X_,)Gy^Xy_, 



(123) 



(124) 



for r,q = 1, . . . ,Q and r ^ q, we can finally define the Q x Q matrix T{t) that plays a key role in studying 
contraction properties of the best-response map associated with the game Q-,^{X,6): 



g,r 



-I3qr{t, C), 



ii r = q, 
otherwise. 



(125) 



It is important to remark here that the off-diagonal entries of the matrix T{t) depend, among other 
quantities, on the cross-channels ||ffg,r(/c)p| and ||Gp^g(A;)p}. Roughly speaking, this dependence is 
such that the (3gr{t, c)'s tend to decrease as the aforementioned cross-channels decrease, meaning that the 
/3gr{t, c) remains "small" as long as the overall MUX in the system remains "small". We will show shortly 
that this is what one needs to guarantee the convergence of the distributed best-response based algorithms 
introduced in Sec. 15.11 More formally, by postulating that T{t) is a P-matrix, Theorem 1151 below states 
the contraction properties of the best-response mapping of the game Gwi^, d) with respect to the suitably 
defined block maximum norm [see proof of the theorem for details] . 



Theorem 15. Given the game Q.,^[X , 9) with exogenous (fixed) price vr > 0, suppose that T{t) in il25\) 
is a P-matrix. Then the following hold: 
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(a) Each nonconvex optimization problem in 1^4^ ^ unique (nontrivial) optimal solution Bq{x^ 
|^7^(x_g), p^(x_g), P{"-*{x_g)^, fov cvevy given x^g G y^q and tt > 0; 



(b) The best-response map 3^ 9 x — ?• ^(x) = (Bg(x_q))^_-|^ is a block- contraction; the unique fixed-point 
of B is the unique x-component of the NE of the game. 

Proof. To prove contraction of the best-response, we need to specify first under which norm the best- 
response map contracts. We wih use the following norms: the block-maximum norm on 

^Q(7V+2)^ defined 

as [ig 

lly|lMock=.ma"^^, for y = (y.)^, G mO(^+2), (126) 

where ||-||^ is a valid vector norm on M^+^ and w = [wi, . . . ^wqY > is any given positive weight 
vector. In particular, we choose ||-||^ as follows: partitioning the vector y.j G as y^ = yi,2:Af+2)j 

with yi,2:Ar+2 (or yi^i) being the [N + l)-length vector containing the last + 1 components (or the 
first component) of yj, and given the matrix Dj(t, c) as defined in (|123p . let the vector norm ||-||j be 
lly|li - lly^2:Af+2|l2)||D,{t,c)2' ^here ||x||j5,(j^^)2 = ||Di(t, c)x||2. As it win be clarified shortly, the 

choice of such a norm is instrumental to obtain convergence conditions that can be satisfied for all ranges 
of c > 0. We also need to introduce the (weighted) maximum norm on MP , defined as |49) 

l|xC,vec=.max^, for x G R^; (127) 
' 1=1,. ..,Q Wi 

and the matrix norm IHI^mat M'^^'^ induced by |H|^ ^ec ' given by |49) 

1 

l|A||^,^at = niax-5]|[A],,>„ for A G M«x«. (128) 

We are now ready to prove the theorem. 

(a) : Given t > 0, the P property of matrix T{t) implies Pq{t) > for all q, and thus Vx^£g(x, tt, Xq) >- 
for all (xg, Xq) G X [0,A™^^], x_g G y~q, and vr > 0. According to Proposition [21 this guarantees the 
uniqueness of the optimal solution Bq{'x.^q) = (-T^(x_q), p*(x„g), Pj'^*(x_g)) of each nonconvex problem 
in (jHI), for every given vr > and x_q G 3^-<j. 

(b) : Given the unique solution Bq{x^q), by Lemma[TTl it follows that there exists a multiplier Xq associated 
with the nonconvex constraint /g(xg) < such that the tuple (i3g(x_g), Ag) satisfies the KKT optimality 
conditions of the optimization problem in (|48p , or equivalently, the VI(/Cg,Fq) defined in (j40p . which we 
rewrite here for the reader's convenience: 

1 T 



Jq - Bq{x^ 
Xq — Xq 



Vx,A {{Bq{x_g), Xq), X_g,7r) 
-Iq (Bqix^q)) 



>0, yiyq,Xq)eyqXR^, (129) 



with V^gCq defined in ([37]). Recah that A, G [0, A™^^] (Lemma[l2|) and Bq{x^g) G 3^* (Lemma[l3|). 
Gonsider now two feasible points x(i) ^ (xJ^))J^i, x(2) 4 (xf )J^i G y, with xf ^ (t^'\ pf , P^^'^ 

for i = 1, 2, and q = 1, . . . ,Q, and let A„ 's be the multipliers associated with the nonconvex constraints 
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{Iq{xq) < 0}'s at the optimal solutions Bq{x 



^^(x^)^), p*(x«),Pj-(x«)), for i = 1,2. Eval- 



uating (jl29p first in the solution (;Sq(x^^), Ag^"*) given (y^, Xg) = (;Bg(x^g), A^^''), then in the solution 
(x^g ) , Ag^^ ) given (y^, Xg) = (Bg^x^lg), X^^^), and summing the resulting inequalities, we obtain 



> 



?,(X«) - Bgi^^^l) 

Y(1) t(2) 



T 



V^^Cg[{Bg{x^ll),xf), xW,vr) - Vx,£, ((fi,(xL^^),A-0, x% 



'n ^^(2)^ t(2)a ^(2) 



vr 



(130) 



By the main-value theorem we deduce that there exists a (5 G (0,1) and a pair (xg((5), x_g(5), Ag((^)) 
<5 • fe(x«),xL'J, aJ'^) + (1 - 5) • fe(xL'h,xL'], Af ) such that 



> (B,(x«) - fi,(xL'h) ((x,(5), A,(<5)), x_,(5),vr)) (fi,(xL')) - fi,(xL'h 



T , 



e,(xL'^) - Bgi^^ll)) (^^('^)' ^-'?('^)) - ^ 



(2) 



(131) 



Using the definition of Bq(x, vr^, Ag), /9g(t), ^g"^, and T)q{t, c) and E^g (x) as given in (|121|) . (|122p . (|123p . 
and (jllSp . respectively, let us introduce for each q = 1, . . . ,Q, the error vectors: 



G f2 



?J(xL'J)-^^(x 

p^(xL'))-p^(xL^), 
pf-(xL'h-p^(xL'h 



and e„ 



.(2) _ ;^(1) 

q iq 

(2) (1) 
Pq -Pq 
pfa(2) pfa(l) 



(132) 



and the matrices 



CJx(<5),A,(<5),^) 



V|£,((x,(5), A,(<5)),x_,(5),7r), 



V2 



V|^(p^ p,.)A((x,(<S), A,(,S)),x_,(5),7r) 



- £q ((xg(5), A5(,5)), x_5(5), tt) , Aniin (v2p^^^,,,j/:^ ((xg(<5), A5(,5)), x_5(5), 7r)j 



B, (x(5),A,(5),7r) 



1-1/Q 



^ D,(i, c)2 



(133) 



and 



Eg, (x(5)) 



< 



V|.eg(x(5)) 



V^,p,.0g(x(5)) 



0, 



max 

k=l,...,N 



\Hqrik)\' 



a 



q,k 



1-1/Q\ (1/Q 

VTq i KVTr 



fSUp 



■psup 



V2^p^(-logrg(p(5))) 



(134) 



where the upper bound in (I134p follows from Lemma [T3l and ( IllSp . Then, from inequality (I13ip . we deduce 



e| Cq{x{6),Xg{6),7r) e^^ < e| ^Eg,.(x(5)) e,.. 



(135) 



r=/=q 
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which, using the bounds in (|133p and (|134p and the definition of f3qr{t, c) in (jl24p . leads 

Bg{t, c)ei^^ < J2 C)"^ Bqr (x((5)) Drit, c)~^|| \\T>r{t, c) BrW^ < J^Mt, c) ||Dr(t, c)e^||2 , 



for all g = 1, . . . , Q (the inequality in (I13ip is trivially satisfied if Dg(t, c) e- 



(136) 

0). Introducing the 



vectors e- 



2?,D 



D5(i,c)y 1 



and 



Q 



■'9IID ft cW ' ^^'^ matrix E(t) = I — T{t), the set of 



inequalities in ()136p can be written in vectorial form as 

eB,D<E(i)eD, Vx«,x(2)g3;, 
and thus, for any given w > 0, we have 



(137) 



B(x«) -B(x 



block 



< IIE 



IE 



block 



loo,mat II^D |loo,vec 1 1 "^V"',' I loo, mat 

DlOCl 

(138) 

for all x'^^^x^^^G y. To complete the proof we need to show that ||E(t)||^ j^^^^. < 1 for some w > 0. 
Invoking Lemma [Mj Lemma 5.2.14] and |42[ Cor. 6.1], we obtain the desired result: 



r(t) is a P-matrix ^ 3w>0 such that = ||E(t)||^ < 1. 



(139) 



□ 



E.2 Asynchronous convergence theorem 

Convergence of best-response algorithms solving the game Gtt{X, 6) follows readily from the block-contraction 
properties of the best-response, as proved in Theorem [15] and is thus guaranteed under the same conditions 
given in Theorem 1151 



Theorem 16. Given the game Q-,^[X ^ 0) with exogenous (fixed) tt > 0, suppose that T{t) in il25\) is a 
P-matrix. Then, any sequence generated by the asynchronous algorithm based on the best-response B and 
starting from any point in y converges to a NE of the game, for any given updating feasible schedule of 
the players. 



E.3 On the contraction/convergence conditions 

We derive here easier conditions to be checked implying those in Theorem [16] (and Theorem [15]) ; this sheds 
light also on their physical interpretation. The approach is similar to that followed to prove Corollary [6] 
we thus provide only a sketch of the proof. 

The main idea is to build a matrix, say r'°™(t), such that T{t) > r^°™(t) [the inequality has to be 
intended component-wise], implying that if r^°™(t) is a P matrix, then T{t) is so [34j, which is the condition 
required by Theorem 1161 Then, we provide sufficient conditions for r'°™(t) to be a P matrix. 

To obtain such a r'°™(t), it is sufficient to properly upper bound (the modulus of) the off-diagonal 
entries Pqr{t,c) of T{t). Given the expression of /3qr{t,c) [cf. (|124p ]. a way to do that is to find a matrix 
B;^°" such that Bg (x, Xq,7rt) > B^°", and a diagonal matrix D^°™(t,c) such that Bq{t,c) > Dj^°^(t,c), 
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where Bg (x, A^, vrj) and Dg(t, c) are defined in (jl2ip and (jl23p . respectively. Skipping tedious intermediate 
derivations, we obtain the following 



B 



low A 



Ale 



V2 £ 



2:N+2 



(140) 



where d?™ is defined in (193 



V2 £, 



2:7V+2 



denotes the (A^ + l)-dimensional lower right block of the 



matrix V|^£q defined in (jlOip . and <iq^{t) is given by 

= 2 maxit, A"^"n • max ^ ''^^'"(^^l' 

^ fc=l,...,Af 



vect (^^^S^^'^'j 

l^vect (cj^^^, p)^'") 



(141) 



with cjS^'^^ and cj^^^^f^. defined in ()94p and (j96p . respectively. Note that, since the following bomids hold 



between the entries of (x, Ag, tt^) and B 



low. 



V%2Cq (x, TTt, Ag^ 



> d2 



c=0 



V4(x„Pfa)/:,(x,A„vri) <,-P(t), 



> least 



V^^_^ pfa)/^g (x, TTj, Aq)j > Al, 



'least 



V2 C 



2;Af+2 



matrix B^ satisfies the desired property Bg (x, \q,7rt) > B^ . 

Finally, using Bg°™, we can introduce a lower bound of the quantities Pg{t) in ()122p 



p|,°"(t) = Aleast (Bl°-) , if pl°-(t) > 0, 

0, otherwise, 



(142) 



and define the matrix (t, c) as 



D|°"(t,c)2 4 



pi°-(i) + c 



1-1/Q 







(143) 



which satisfies Dg(t,c) > Dg°™(t, c). Using the above matrices, the desired upper bound f3qr{t, c) of the 
coefficients f3qr{t, c) is 

/3^nt, c) ^ ||D^°-(i, c)-iE™PDi°-(t, c)-M| 



max < 



(1 - i/Q)^ 



{t)fr 

(1 - 



max 
k=l,...,N 



+ C 



a 



q,k 



S9 



pi°-(t)vVF(i) 



^ > /3gr(t, C), 



(144) 
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with ^g"^ and EglJ^ defined in (jllSp and (jl34p , respectively. Using these quantities it is not difficult to see 
that the matrix r'°^(t) defined as 

rj.low(^)i A 1 1' iir = q, 

L V !\q^r I -/3^P(t, c), otherwise, ^ ' 

satisfies the desired property V{t) > r^°™(t) for any t > 0. 

Since r'°'*(t) is a P matrix if and only if p {l - T^""" {t)) < 1 [Ml Lemma 5.2.14], imposing that 
I — r^°™(t) is row or column diagonal dominat, leads to the desired sufficient conditions guaranteeing 
convergence of asynchronous algorithms based on the best-response B. This is made formal in the corollary 
below. 

Corollary 17. Statements in Theorem\l^ (or Theorem \15\) hold true if one of the two following conditions 
is satisfied: 

- Low received MUI.- for all q = 1, . . . ,Q, 

^/3gT(t, C) < 1, ^^4g^ 



- Low transmitted MUI.- for all r = 1, . . . ,Q, 

^^,T(*,c)<L (147) 

The physical interpretation of the above conditions is similar to that given for the existence/uniqueness 
of the NE (cf. Section l4.3p . Roughly speaking, conditions (|146p or p47p require "low" interference in 
the network, meaning "small" values of the (normalized) cross-channels \Hqr{k)\'^ /^^ f, as well as large 
values of coefficients p^g'^it), which is met if, among all, the (normalized) cross-channels \Gp^q{k)\'^ / 1'^'^^ 
are "sufficiently small". An illustrative example is obtained in the two opposite cases where there is no 
optimization of the sensing times (and thus c = 0) or the sensing times are optimized by imposing a 
common optimal sensing time by choosing a (sufficiently) large constant c (and there are many active 
SUs). For those two cases, conditions (|146p and ()147p reduce respectively to 



T.,S'^:J^-^^\-r,r<h and Y,,S-^,V-^^\y,.<h (148) 

ry^q y I''' ) q^r K 1''^ ) 



2 



with 



Iqr 



^sup 



Note that 7gr's, among all, depend on the cross-channels |Gp.q(fc)p//™^^, and become "small" when 
|Gp^g(A;)p//™'^^ are small. Conditions (jl48p are thus satisfies if there is not "too much" interference in the 
system. 
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F Convergence of Best-Response Algorithms for Q{X,6) 



F.l Proof of Theorem dU] 



First of all note that, given A*^ > and vr^ > and under the setting of Lemma[71 the game Qt{X , 0, A'', 7rf ) 
has a unique NE, denoted by (x*(A'^, vrf), A*(A°, tt^), 7r^(A°, 7r°)) , where we made explicit the dependence 
on the regularization tuple (A°,7rj0). This makes the sequence {(x*(A", ttj"), A*(A", vrH, <(A", 7rn)}^=o 
generated by Algorithm [3] well defined. The uniqueness of the NE of Qt{X,6,\^,TT^) can be proved by 
exploring the connection between the game and a suitably defined VI, as briefly outlined next. Under the 
positive definiteness of matrix A (x, A,7rt) (as required by Lemma [7|, Qt{<^ , ^, A*^, vr^) is equivalent to the 
W[Zt, ^ xo ^o), with Zt given in (j45p and the VI function ^_^o ^o(x, A,7rj) defined as 



*;,o^^o (x. A, TTt) = *(x. A, vrj) + e • 




Q(Ar+2)xl 

/ AO 







(149) 



In other words, the VI(2t, ^_^o^o) is obtained by the VI(2t, ^) in (j45p introducing the proximal regu- 
larization of some of the VI variables, namely the A-variables and vr^-variable. The Jacobian matrix of 
^_j^o .n-o(x. A, VTf) denoted by J^'_;^o .^o(x. A, Tit) is 



A(x,A,^t) Vxl(x) Vx/(x) 
-VxI(x)^ e • I 



-V./(x 







(150) 



where Vxl(x) = Vx[/i(xi), • • • , /q(xq)]. If A (x, A,7rt) is uniformly positive definite, then so is (x, 
A, vr^). It turns out that, under the setting of LemmaO the regularized Yl{Zt, ^ x° tt") strongly monotone 
|35| Prop. 2.3.2(c)] and thus has a unique solution |35| Th. 2.3.3], implying the uniqueness of the NE 
(x*(AO, vrO), A^(A0, tt?), <(A0, tt?)) of ^^(Af, 0, A", vr?). 

Once we have proved that (x*(A, vr^), A*(A, ttj), 7r*(A, vr^)) is well defined for any given A > and 
T^t ^ 0, we can derive the main properties of such a tuple [interpreting its components as functions of 
(A,7rt)], along with its connection with the NE of the game Gti^, ^) [and thus G{X, 9)\; these properties 
will be instrumental to prove Theorem 1101 

Proposition 18. Given t > 0, suppose that A(x, A,7r() in is uniformly positive definite for all 

(x, XjTTt) & y y. [0, X^^^]Q X St, and let e > be given. Then the following hold: 

(a) The mapping associated with the X-components and tt- component 0/ (x*(A, Tr^), A*(A, vrt), 7r*(A, ttj)), 
i.e., 

' A^fO 

has a fixed point, and it is nonexpansive on [0, A™'^^]'^ x St; 

(b) The mapping associated with the x-components 0/ (x*(A, Tr^), A*(A, vr^), 7r*(A, vr^)), i.e., 



\;^\M:[0,A-]«x5,B(A,vr,)^f ^^J^'^'l) 



fl511 



x' 



(152) 
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is Lipschitz continuous on [0, X"^^^jQ x St, i.e., there exists a constant < u < +00 such that 

||x'^(A«,vrf))-x'^(A(^\vrf )|b < HKA^, " (A^^^ (153) 

for all (A(i),7rJ^)), {X^^\4'^^) G [0,A'"^"]Q x 5j; 

(c) For any fixed-point (A,7f() G [0,A™'^'^]'5 x St of {X* {■) , tt* {■)) , the tuple (x*(A,7f(), \,7ft) is a solution 
of the VI{Zt, *); therefore, it is a NE ofQti^, 0). 

Proof. We prove next only (a) and (b); (c) follows similarly. 

(a) Let (x. A, 7f() € 2^ be a solution of the VI(2j, ^) in (|45p . whose existence is guaranteed by Lemma[71 
recall that, by LemmalU it must be (x,A,7ft) G y -><[{), y'''''']^ x St- It follows that: i) (x*(A,7ft), A*(A,7ff), 
7r*(A,7ft)) is the unique solution of the W{Zt, *X7ft)' (^) A,vft) is also a solution of VI(2^t, ^'■^-J. 

Hence, it must be x*(A,7ff) = x, A*(A,7ft) = A, and 7r*(A,7ft) =7fj, which implies the existence of a 
fixed-point of the mapping (A*(-), 7r*(-)) in (I15ip : moreover, since (x, A,7rt) G 3^ x [0, A™^^]*^ x St, such a 
fixed point is in [0, X^^^]Q x St. 

We prove now that (A*(-), 7r*(-)) is nonexpansive on [0, A™^^]*^ x St. Given two distinct tuples 
(A(i),7rf ^), (A(2),^f )) G [0, A"^^^]Qx5f, by definition, the tuples (x^(A«, vrf ^), A*(A«, ^), 7ri^(A«, ^)), 
with i = 1,2, satisfy the following: 



/ x-x*(A(^),7rf^) 
A-A*(AW,vrP) 
V ^i-<(A«,vr«) 



V.,£, fx^(A«,vr«), A^(A»,vr«),<(A«,vr« 



^ (/,(x*(A»,vr« 
^ /(x^(A«,vr« 



9=1 



9=1 



A'^(A«,7rl*')-A« 
vr?(A«,vr?)-vr« 



> 0, (154) 



for all (x. A, vtj) G 3^* x [0, A™'*^]'^ x St and i = 1, 2. Thus, similar to the proof of Theorem 1151 we deduce 



/ x^(A(2),vrf))-x^(A«,vr«) 
A*(A(2),vrf )-A'^(A«,7rf)) 
\<(A(^UfV<(A(^\vr«) 



/ (v.,£,(x*(AW,vrf)),A^(AW,.f)),<(A«,.«)))^^^ \ 



v 



v 



/fx^(A(2),vrf ) 



9=1 



+ e 



A^(A«,^«)-A« 

<(A«,vrr))-.r) 



+ 



/ A*(A(2),vrf))-A(2) 
<(A(^),vrf))-vrf) 



f/,fx^(A«,7r 



(1) 



1^ /(x'^(AW,vrW) 



9=1 



/J 



> 0. 



(155) 

By the mean-value theorem, it follows that there exists a tuple (x^, A^, vr^) lying on the line segment joining 
(x^(A«,7rf)),A^(A«,7rf)),<(AW,7r(^))) and (x^(A(2), vrf )), A*(A(2), vrf )), vr,*(A(2), vrf )) such that [see 
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also (fT50]l ] 

(x'^(A(2),.f ) - x'^(A«,vr«))^ A(x,, A,, vr,) (x'^(A(2), vrf )) - x*(A(^),vr«; 



< e- 



A^(A(2),vrf))-A*(A«,vr«) 
<(A(^),.f)-<(A«,-l^^) 



T 



A(2) _ Ad) 



vr. 



(2) 



vr. 



(1) 



e • 



A^(A(^),vrf)-A^(A«,vr«; 
<(A(^),.f)-<(A«,vr«: 



2 

(156) 



Applying the Cauchy-Schwartz inequality and reorganizing terms we obtain: 



A*(A(2),vrf)-V(AW,vrl 



-(A(2), 



(2)n 



<(A« 



< 



A^(A(2),vrf))-A*(AW,vr, 



<(A(2),vrP)-<(A«,vr, 



t I 









2 


Up 





i (x*(A(2),vrf ) -x*(A(i),vr«))^A(x,, A,, U[}P-\.f)-^^{X^\.\ 



< 



A^(A(2),^f )-A*(A«,7r} 
<(A(^.f))-<(A«,vrf)) 





/ A(2) 


- A'") \ 


2 




-1" J 



where the last inequality follows from the positivity of the quadratic form, due to the positive definiteness 
of A(x5, A5, TT^); which proves the desired nonexpansive property of the mapping (A*(-), 7r*(-)): 



A*(A(2),vrfVA*(A«,7r, 



<(A(2),vrP)-<(AW,vr^ 



(l)^ 
t , 



< 



A(2) _ Ad) 

(2) ^(1) 



vr. 



(157) 



(b) Following similar steps as in (a) and using the Cartesian structure of the set Zt we deduce that, for any 
given (Ad),7rd)), (A'^^), vr^^^^) G [0, A™'^"^]*^ x5t, there exists a tuple (x^, A,^, tt^) lying on the segment joining 
(x*(Ad),vrf)), A*(A«,7rf)),<(A«,7rf))) and (x*(A(2), vrf )), A*(A(2), vrf )), vr?(A(2), ))) such that 

T 



x^(A(2),vrP) -x^(A«,vrW) A (x„ A„ vr,) x^(A(2), vrf) - x^(AW, x 



< 



x*(A(2),vrf)-x^(A«,vr, 



||[VxI(x,),VxI(x, 



/ API 


- Al') \ 







(158) 

where the last inequality follows from the Cauchy-Schwartz inequality and the nonexpansive property of 
(A*(-), 7r*(-)) [cf. (jl57p ]. and Vxl(x) = Vx[/i(xi), • • • ,Iq(xq)]. Invoking the uniform positive definiteness 
of A( X,, A,, vr,) and the boundedness of the set 3^, we deduce from ()158p 



x^(A(^),.f))-x^(AW,.d)) 



< 



||[VxI(x,),VxI(x,) 



A(2) _ Ad) 



vr. 



(2) 



vr. 



(1) 



< u ■ 



Aleast (A(x,, A,, vr,)) 

for some finite positive v, which proves the desired Lipschitz continuity of x*(-) on [0, A 



A{2) _ Ad) 

-(2) _ ^(1) 

(159) 
xSt. □ 
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Proof of Theorem IIOI We are now ready to prove the theorem. The outer loop of Algorithm [10] [see 
(|62p in Step 3] is an instance of the Jacobi Over Relaxation, JOR, method |5U| applied to the mapping 
(A*(-), 7r*(-)); which, using the notation introducing above, can be equivalently rewritten as: 



I =^1-^)- Sn) x(n) Jn) ■ (160) 



Since (A*(-),<(-)) is nonexpansive on [Q,^^'''']^ x St [Proposition [HJ^a)], the sequence {(A^"), 7rJ"^)}~=i 
generated by the JOR scheme (jl60p converges to a fixed-point (A,7fi) of (A*(-),<(-)) EOl Th. 12.3.7]. 
By Proposition [T51f b) [see (|153p ]. the convergence of {(A^"\ tTj""^)}^^ implies also the convergence of the se- 
quence {x*(A("),7rJ"^)}^^i in the inner loop of Algorithm[TO]to x* (A, Tit) ; the limit point (x* (A, tt^) , A, TTf ) 
is the claimed NE of QtiX, 6) [Proposition [HJ^c)], and thus g{X, 6), if t > A'"'"^ (Theorem [5]) . □ 
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